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Polvketides and their synthesis 

Field of Invention 

The present invention relates to processes and materials (including recombinant 
strains) for the preparation and isolation of macrolide compounds, particularly compounds 
differing from natural compounds at least in terms of glycosylation. It is particularly 
concerned with erythromycin and azithromycin analogues wherein the natural sugar at the 5- 
position has been replaced. The invention includes the use of recombinant cells in which 
gene cassettes are expressed to generate novel macrolide antibiotics. 



Background to the Invention 

The biosynthetic pathways to the macrolide antibiotics produced by actinomycete 
bacteria generally involve the assembly of an aglycone structure, followed by specific 
modifications which may include any or all of: hydroxyiation or other oxidative steps, 
methylation and glycosylation. In the case of the 14-membered macrolide erythromycin A 
these modifications consist of the specific hydroxyiation of 6-deoxyerythronolide B to 
erythronolide B which is catalysed by EryF, followed .by the sequential attachment of 
mycarose via the hydroxyl group at C-3 catalysed by the mycarosyltransferase EryBV 
(Staunton and Wilkinson, 1997). The attachment of desosamine via the hydroxyl group at C- 
20 5, catalysed by EryCIII, then results in the production of erythromycin D, the first 
intermediate with antibiotic activity. Erythromycin D is subsequently converted to 
erythromycin A by hydroxyiation at C-12 (EryK) and O-methylation (EryG) on the 
mycarosyl group, this order being preferred (Staunton and Wilkinson, 1997). The 
biosynthesis of dTDP-L-mycarose and dTDP-D-desosamine has been studied in detail 
(Gaisser etal., 1997; Summers et al, 1997; Gaisser et al, 1998; Salah-Bey et al., 1998). 

Recently 3.1 A high-resolution X-ray investigation of the interaction of ribosomes 
with macrolides (Schlunzen et al, 2001, Hansen et al, 2002) has revealed key interactions 
giving direct insights into ways in which macrolide templates might be adapted, by chemical 
or biological approaches, for increased ribosomal binding and inhibition and for improved 
effectiveness against resistant organisms. In particular, previous indications about the 
importance of the sugar substituent at the C-5 hydroxyl of the macrocycle for ribosomal 
binding are fully borne out by the structural analysis; this substituent extends towards the 
peptidyl transferase centre and in the case of 16 membered macrolides, which bear a 
disaccharide at C-5, reaches further into the peptidyl transferase centre, thus providing a 
molecular basis for the observation that 16 membered macrolides inhibit ribosomatcapaciry 
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to form even a single peptide bond (Poulsen et al, 2000). This suggests that erythromycins 
with alternative substituents at the C-5 positions, for example mycaminosyl and 
angolosaminosyi erythromycins, and in particular mycaminosyl and 4'-0 substituted 
mycaminosyl erythromycins, are highly desirable as potential anti-bacterial agents. 
5 Since post-polyketide synthase modifications are often critical for biological activity 

(Liu and Thorson, 1994; Kaneko et al, 2000), there has been increasing interest in 
understanding the mechanism and specificity of the enzymes involved to engineer the 
biosynthesis of diverse novel hybrid macrolides with potentially improved activities. Recent 
work has demonstrated that the manipulation of sugar biosynthetic genes is a powerful 
10 approach to isolate novel macrolide antibiotics. The recently demonstrated relaxed specificity 
of the glycosyltransferases is crucial for this approach (see Mendez and Salas, 2001 and 
references therein). In the pathways to erythromycin A and methymycin / neomethymycin, 
the production of hybrid macrolides has been observed after inactivation of specific genes 
involved in the biosynthesis of deoxyhexoses (Gaisser et al, 1997; Summers et al, 1997; 
15 Gaisser et al, 1998; Salah-Bey et al, 1998; Zhao et al, 1998a; Zhao et al, 1998b) or after 
the expression of genes from different biosynthetic gene clusters (Zhao et al, 1999). A 
relaxed specificity towards the sugar substrate has also been reported for glycosyltransferases 
that have been expressed in heterologous strains, including glycosyltransferases from the 
pathways to vancomycin (Solenberg et al, 1997), elloramycin (Wohlert et al, 1998), 
20 oleandomycin (Doumith et al, 1999; Gaisser et al, 2000), pikromycin (Tang and McDaniel, 
2001), epirubicin (Madduri et al, 1998), avermectin (Wohlert et al, 2001) and spinosyn 
(Gaisser et al, 2002a). Most of the successful alterations so far reported have involved 
relaxed specificity towards the activated sugar moiety, while as yet only isolated examples are 
known where a glycosyltransferase targets its deoxysugar to an alternative aglycone substrate 
25 (Spagnoli et al, 1983; Trefzer et al, 1999). Both WO 97/23630 and WO 99/05283 describe 
the production of erythromycins with an altered glycosylation pattern in culture supernatants 
by deletion of a specific sugar biosynthesis gene. Thus WO 99/05283 describes low but 
detectable levels of 5-0-dedesosaminyl-5-0-mycaminosyl erythromycin D in the culture 
supernatant of an eryCIV knockout strain of S. erythraea. It also has been demonstrated that 
30 the use of the gene cassette technology described in patent WO01/79520 is a powerful and 
potentially general approach to isolate novel macrolide antibiotics by expressing 
combinations of genes in mutant strains of S. erythraea (Gaisser et al, 2002b). WO 01/79520 
also describes the detection of S-O-dedesosaminyl-S-O-mycaminosyl erythromycin A in 
culture supernatants of the S. erythraea strains SGQ2pSGCIII and SGQ2p(mycaminose)CIII, 
35 fed with 3-O-mycarosyl erythronolide B. However, the low levels of 5-0-dedesosaminyl-5-0- 
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mycaminosyl erythromycin A make this a less than optimal method for producing this 
valuable material on large scales and similar problems were encountered synthesizing 5-0- 
dedesosaminyl-5-O-mycaminosyl erythromycin A using chemical methods (Jones et aL, 
1969). EP 1024145 refers to the isolation of azithromycin analogues carrying a mycaminosyl 
5 residue such as 5 -0-dedesosaminy 1-5 -0-my caminosy 1 azithromycin and 3' 5 -desmethyl-5-0- 
dedesosaminyl-5-0-mycaminosyl azithromycin. However the only examples given in this 
area are "prophetic examples" and there is no evidence that they could actually be put into 
practice. 

Therefore, the present invention provides the first demonstration of an efficient and 
1 0 highly effective method for making significant quantities of erythromycins and azithromycins 
which have non-natural sugars at the C-5 position, in particular mycaminose and 
angolosamine. In a specific aspect the present invention provides for the synthesis of 
mycaminose and angolosamine using specific combinations of sugar biosynthetic genes in 
gene cassettes. 

15 

Summary of the Invention 

The present invention relates to processes, and recombinant strains, for the 
preparation and isolation of erythromycins and azithromycins, which differ from the 
corresponding naturally occurring compound in the glycosylation of the C-5 position. In 

20 particular, the present invention relates to processes and recombinant strains for the 
preparation and isolation of 5-0-dedesosaminyl-5-0-mycaminosyl, or angolosaminyl 
erythromycins and azithromycins, in particular 5-0-dedesosaminyl-5-0-mycaminosyl 
erythromycins and 5-0-dedesosaminyl-5-0-mycaminosyl azithromycins, and specifically 5- 
0-dedesosaminyl-5-0-mycaminosyl erythromycin B, 5-0-dedesosaminyl-5-0-mycaminosyl 

25 erythromycin C, 5-0-dedesosaminyl-5-0-mycaminosyl erythromycin D, 5 -0-dedesosaminy 1- 
5-0-mycaminosyl erythromycin A, and 5-0-dedesosaminyl-5-0-mycaminosyl azithromycin. 
The present invention further relates to novel 5-0-dedesosaminyl-5-0-mycaminosyl 3 
angolosaminyl erythromycins and azithromycins produced thereby. 

3 0 Detailed description of the Invention 

The present invention relates to processes, and recombinant strains, for the 
preparation and isolation of erythromycins and azithromycins which differ from the naturally 
occurring compound in the glycosylation of the C-5 position. These are referred to herein as 
"compounds of the invention" and unless the context dictates otherwise, such a reference 
35 includes a reference to 5-0-dedesosaminyl-5-0-mycaminosyl erythromycins, 5-0- 
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dedesosaminyl-5-0-angolosaminyl erythromycins, 5-0-dedesosaminyl-5-0-mycaminosyl 
azithromycins, and 5-O-dedesosaminyl-5-0-angolosaminyl azithromycins, specifically 5-0- 
dedesosaminyl-5-0-mycaminosyl erythromycin A, 5-0-dedesosaminyl-5-0-tnycaminosyl 
erythromycin C, 5-0-dedesosaminyl-5-0-mycaminosyl erythromycin B, 5-0-dedesosaminyl- 
5 5-0-mycaminosyl erythromycin D, 5-0-dedesosaminyl-5-0-mycaminosyl azithromycin, 5-0- 
dedesosaminyl-5-0-angolosaminyl erythromycin A, 5-0-dedesosaminyl-5-0-angolosaminyl 
erythromycin B, 5-0-dedesosaminyl-5-0-angolosaminyl erythromycin C, 5-0- 
dedesosaminyl-5-0-angolosaminyl erythromycin D, 5-0-dedesosaminyl-5-0-angolosaminyl 
azithromycin and analogues thereof which additionally vary in glycosylation at the C3 

10 position (see WO 01/79520) and which may also vary in the aglycone backbones (see WO 
98/01571, EP 1024145, WO 93/13663, WO 98/49315). The invention relates to processes, 
and recombinant strains, for the preparation and isolation of compounds of the invention. The 
present invention further relates to novel 5-0-dedesosaminyl-5-0-angolosaminyi 
erythromycins and azithromycins produced thereby (Figure 1). The methodology comprises 

15 in part the expression of a gene cassette in the S. erythraea mutant strain SGQ2 (which carries 
genomic deletions in eryA, eryCIIl, eryBVand eryCIV (WOO 1/79520)), as described in 
Example 3 and 6 and in S. erythraea Q42/1 (BIOT-2166) (Examples 1- 4) and S. erythraea 
18A1 (BIOT-2634) (Example 6). Detailed descriptions are given in Examples 1-11. 

The invention relates to a process involving the transformation of an actinomycete 

20 strain, including but not limited to strains of S. erythraea such as SGQ2, (see WO 01/79520) 
or Q42/1 or 18A1 (whose preparation is described below) with an expression plasmid 
containing a combination of genes which are able to direct the biosynthesis of a sugar moiety 
and direct its subsequent transfer to an aglycone or pseudoaglycone. 

In a particular embodiment the present invention relates to a gene cassette containing 

25 a combination of genes which are able to direct the synthesis of mycaminose in an appropriate 
strain background. The gene cassette may include genes selected from but not limited to 
angor/14, tylMIII, tylMI, tylB, tylAI, tylAII, tylla, angAI, angAII, angMIII, angB, angMI, 
eryG, eryK and glycosyltransferase genes including but not limited to tylMII, angMII, desVII, 
eryCIIl eryBV, spnP, and midL In a preferred embodiment the gene cassette comprises 

30 angorfl4 in combination with one or more other genes which are able to direct the synthesis 
of mycaminose. In an more preferred embodiment the gene cassette comprises angAI 
angAII, angorfl4, angMIII angB, angMI, in combination with one or more 
glycosyltransferases such as but not limited to eryCIIl tylMII, angMII In an alternative 
embodiment the gene cassette comprises tylAI, tylAIl tylMIII, tylB, tylla, tylMI 'm 
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combination with glycosyltransferases such as but not limited to eryCIII, tylMIIand angMIL 
In a preferred embodiment the strain is an S. erythraea strain. 

In a particular embodiment the present invention relates to a gene cassette containing 
combinations of genes which are able to direct the synthesis of angolosamine, including but 
5 not limited to angMIII, angMI, angB, angAI, angAII, angor/14, angorf4, tylMIII, tylMI, tylB, 
tylAI, tylAII, eryCVI, spnO, eryBVI, and errand one or more glycosyltransferase genes 
including but not limited to eryCIII, tylMI, angMII, desVII, eryBV, spnP and midl. In a 
preferred embodiment the gene cassette contains angMIII angMI, angB, angAI angAII, 
angor/14, spnO in combination with a glycosyltransferase gene such as but not limited to 
10 angMII tylMI or eryCIII. In a preferred embodiment the strain is an S. erythraea strain. 

In one embodiment, the process of the present invention further involves feeding of 
an aglycone and/or a pseudoaglycone substrate (for definition see below), including (but not 
limited to) 3-0-mycarosyl erythronolide B, erythronolide B, 6-deoxy erythronolide B, 3-0- 
mycarosyl-6-deoxy erythronolide B, tylactone, spinosyn pseudoaglycone, 3-Orhamnosyl 
1 5 erythronolide B, 3 -0-rhamnosy 1-6-deoxy erythronolide B to cultures of the transformed 

actinomycete strains, the bioconversion of the substrate to compounds of the invention and 
optionally the isolation of said compounds. This process is exemplified in Examples 1-11. 
However, a person of skill in the art will appreciate that in an alternative embodiment the host 
cell can express the desired aglycone template, either naturally or recombinantly. 
20 As used herein, the term "pseudoaglycone" refers to a partially glycosylated 

intermediate of a multiply-glycosylated product. 

Those skilled in the art will appreciate that alternative host strains can be used. A 
preferred cell is a prokaryote or a fungal cell or a mammalian cell. A particularly preferred 
host cell is a prokaryote, more preferably host cell strains such as actinomycetes, 
25 Pseudomonas, mycobacteria, and E. coll. It will be appreciated that if the host cell does not 
naturally produce erythromycin, or a closely related 14-membered macrolide, it may be 
necessary to introduce a gene conferring self-resistance to the macrolide product, such as 
ermE from S. erythraea. Even more preferably the host cell is an actinomycete, even more 
preferably strains that include but are not limited to S. erythraea, Streptomyces griseofuscus, 
30 Streptomyces cinnamonensis, Streptomyces albus, Streptomyces lividans, Streptomyces 
hygroscopicus sp., Streptomyces hygroscopicus var. ascomyceticus, Streptomyces 
longisporoflavus, Saccharopolyspora spinosa, Streptomyces tsukubaensis, Streptomyces 
coelicolor, Streptomyces fradiae, Streptomyces rimosus, Streptomyces avermitilis, 
Streptomyces eurythermus, Streptomyces venezuelae, Amycolatopsis mediterranei. In a more 
35 highly preferred embodiment the host cell is S. erythraea. 
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It will readily occur to those skilled in the art that the substrate fed to the recombinant 
cultures of the invention need not be a natural intermediate in erythromycin biosynthesis. 
Thus, the substrate could be modified in the aglycone backbone (see Examples 8-1 1) or in the 
sugar attached at the 3-position or both. WO 01/79520 demonstrates that the desosaminyl 
5 transferase EryCIII exhibits relaxed specificity with respect to the pseudoaglycone substrate, 
converting 3-O-rhamnosyl erythronolides into the corresponding 3-O-rhamnosyl 
erythromycins. Appropriate modified substrates may also be produced by chemical semi- 
synthetic methods. Alternatively, methods of engineering the erythromycin-producing 
polyketide synthase, DEBS, to produce modified erythromycins are well known in the art (for 

10 example WO 93/13663, WO 98/01571, WO 98/01546, WO 98/493 15, Kato, Y. et aL, 2002 ). 
Likewise, WO 01/79520 describes methods for obtaining erythronolides with alternative 
sugars attached at the 3-position. Therefore, the term "compounds of the invention" includes 
all such non-natural aglycone compounds as described previous additionally with alternative 
sugars at the C-5 position. All these documents are incorporated herein by reference. 

15 It will readily occur to those skilled in the art that the compounds of the invention 

containing a mycaminosyl moiety at the C-5 position could be modified at the C4 hydroxyl 
group of the mycaminosyl moiety, including but not limited to glycosylation (see also WO 
01/79520), acylation or chemical modification. 

The present invention thus provides variants of erythromycin and related macrolides 

20 having at the 5-position a non-naturally occurring sugar, in particular an O-mycaminosyl, or 
angolosaminyl residue or a derivative or precursor thereof, specifically an 0- angolosaminyl 
residue or a derivative thereof. 

The term "variants of erythromycin" encompasses (a) erythromycins A, B, C and D; 
(b) semi-synthetic derivatives such as azithromycin and other derivatives as discussed in EP 

25 1024145, which is incorporated herein by reference; (c) variants produced by genetic 

engineering and semi-synthetic derivatives thereof. Variants produced by genetic engineering 
include variants as taught in, or producible by, methods taught in WO 98/01571, EP 1024145, 
WO 93/13663, WO 98/49315 and WO 01/79520 which are incorporated herein by reference. 
The compounds of the invention include variants of erythromycin where the natural sugar at 

30 position C5 has been replaced with mycaminose or angolosamine and also includes 

compounds of the following formula (1) and pharmaceutically acceptable salts thereof. No 
stereochemistry is shown in Formula 1 as all possibilities are covered, including "natural" 
stereochemistries (as shown elsewhere in this specification) at some or all positions. 

35 Formula I: 
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R 1= H, CH 3 , C 2 H 5 or selected from i) see below 

R 2 , R 4 , R 5 , R 6 , R 7 and R 9 are each independently H, OH, CH 3 , C 2 H 5 or OCH 3 
R 3 = H or OH 
R 8 = Hor 

OR 10 

or selected from iv) see below 




R 10 = H or CH 3 or acyl 

R u = H or I OR 10 




O 



OR 



12 



Ri2= Hor acyl 
R 13 = HorCH 3 

R 



16 



V NMe 2 

Rl5= s <J^tZ^ ORU 

R 16 = HorOH 

R 14 = H or-C(0)NR c R d wherein each of R° and R d is independently H, C,-C 10 alkyl, C 2 -C 20 
alkenyl, C 2 -C 10 alkynyl, -(CH 2 ) m (C 6 -C 10 aryl), or-(CH 2 ) m (5-10 membered heteroaryl), 
wherein m is an integer ranging from 0 to 4, and wherein each of the foregoing R° and R d 
groups, except H, may be substituted by 1 to 3 Q groups; or wherein R c and R d may be taken 
together to form a 4-7 membered saturated ring or a 5-10 membered heteroaryl ring, wherein 
said saturated and heteroaryl rings may include 1 or 2 heteroatoms selected from O, S and N, 
in addition to the nitrogen to which R° and R d are attached, and said saturated ring may 



include 1 or 2 carbon-carbon double or triple bonds, and said saturated and heteroaryl rings 
may be substituted by 1 to 3 Q groups; or R 2 and R 17 taken together form a carbonate ring; 
each Q is independently selected from halo, cyano, nitro, trifluoromethyl, azido, -C(0)Q 1 , - 
00(0)0', -C(0)OQ 1 , -OC(0)OQ\ -NQ 2 C(0)Q 3 , -C(0)NQ 2 Q 3 , -NQ 2 Q 3 , hydroxy, d-C 6 
alkyl, C,-C 6 alkoxy, -(CH 2 ) m (C 6 -C 10 aryl), and -(CH 2 ) m (5-10 membered heteroaryl), wherein 
m is an integer ranging from 0 to 4, and wherein said aryl and heteroaryl substituents may be 
substituted by 1 or 2 substituents independently selected from halo, cyano, nitro, 
trifluoromethyl, azido, -0(0)0', -C(0)OQ', -00(0)00', -NQ 2 C(0)Q 3 , -C(0)NQ 2 Q 3 , - 
NQ 2 Q\ hydroxy, 0,-Ce alkyl, and d-C 6 alkoxy; 

each Q l , Q 2 and Q 3 is independently selected from H, OH, d-do alkyl, C,-C 6 alkoxy, 
C 2 -C 10 alkenyl, C 2 -C 10 alkynyl, -(CH 2 )m(C 6 -C 10 aryl), and -(CH 2 ) m (5-10 membered 
heteroaryl), wherein m is an integer ranging from 0 to 4; with the proviso that the compound 
is not 5-0-dedesosaminyl-5-0-mycaminosyl erythromycin A or D. 

The present invention also provides compounds according to formula I above in which: 

i) the substituent R 1 is selected from 

- an alpha-branched C 3 -C 8 group selected from alkyl, alkenyl, alkynyl, 
alkoxyalkyl and alkylthioalkyl groups any of which may be optionally 
substituted by one or more hydroxyl groups; 

- a C 5 -C 8 cycloalkylalkyl group wherein the alkyl group is an alpha-branched 
C 2 -C 5 alkyl group 

- a C 3 -C s cycloalkyl group or C 5 -C 8 cycloalkenyl group, either of which may 
optionally be substituted by one or more hydroxyl, or one or more C,-C 4 
alkyl groups or halo atoms 

- a 3 to 6 membered oxygen or sulphur containing heterocyclic ring which may 
be saturated, or fully or partially unsaturated and which may optionally be 
substituted by one or more C,-C 4 alkyl groups, halo atoms or hydroxyl groups. 

- phenyl which may be optionally substituted with at least one substituent 
selected from C r C 4 alkyl, C r C 4 alkoxy and Ci-C 4 alkylthio groups, halogen 
atoms, trifluoromethyl, and cyano or 

- R 1 is R 17 -CH 2 - where R 17 is H, C r C 8 alkyl, C 2 -C 8 alkenyl, C 2 -C 8 alkynyl, 
alkoxyalkyl or alkylthioalkyl containing from 1 to 6 carbon atoms in each 
alkyl or alkoxy group wherein any of said alkyl, alkoxy, alkenyl or alkynyl 
groups may be substituted by one or more hydroxyl groups or by one or more 
halo atoms; or a C 3 -C 8 cycloalkyl or C 5 -C 8 cycloalkenyl either of which may 
be optionally substituted by one or more C r C 4 alkyl groups or halo atoms; or 
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a 3 to 6 membered oxygen or sulphur containing heterocyclic ring which may 
be saturated or fully or partially unsaturated and which may optionally be 
substituted by one or more C r C 4 alkyl groups or halo atoms; or a group of 
the formula SAi 6 wherein Ai 6 is C r C 8 alkyl, C 2 -C 8 alkenyl, C 2 -C 8 alkynyl, 
C 3 -C 8 cycloalkyl, C 5 -C 8 cycloalkenyl, phenyl or substituted phenyl wherein 
the substituent is C r C 4 alkyl, C r C 4 alkoxy or halo, or a 3 to 6 membered 
oxygen or sulphur-containing heterocyclic ring which may be saturated, or 
fully or partially unsaturated and which may optionally be substituted by one 
or more C1-C4 alkyl groups or halo atoms 

ii) the substituents R 2 , R 4 , R 5 , R 6 , R 7 and R 9 are each, independently, H, OH, CH 3 , 
C 2 H 5 , OCH 3 

iii) the -CHOH- at CI 1 (erythromycins) or C12 (azithromycins) is replaced by a 
methylene group (-CH2-), a keto group (C=0), or by a 10,1 1-olefmic bond 
(erythromycins) or 11,12-olefinic bond (azithromycins) 

iv) R 8 includes but is not limited to rhamnose, 2 ? -<9-methyl rhamnose, 2\3 '-bis-0- 
methyl rhamnose, 2\3%4'-tri-<9-methyl rhamnose, oleandrose, oliose, digitoxose 
or olivose 

v) the substituent R 11 is H or mycarose or C4-<3-acyl-mycarose or glucose 
The present invention also provides compounds according to formula I above which 

differ in the oxidation state of one or more of the ketide units (i.e. selection of alternatives 
from the group: -CO-, -CH(OH)-, alkene -CH-, and CH 2 ) where the stereochemistry of any - 
CH(OH)- is also independently selectable. 

Novel 5-0-dedesosaminyl-5-0-angolosaminyl erythromycins and azithromycins 
made available by this aspect of the invention include, but are not limited to those where in 
the R 15 group R u = R 16 = H, with the proviso that they are not angolamycin or medermycin 
(Kinumaki and Suzuki, 1972; Ichinose et aL, 2003). 

Additionally, a person of skill in the art will appreciate that using the methods of the 
present invention mycaminose and angolosamine may be added to other aglycones or 
pseudoaglycone for example (but without limitation) tylactone or spinosyn pseudoaglycone. 
These other aglycones or pseudoaglycones may be the naturally occurring structure or they 
may be modified in the aglycone backbone, such modified substrates may be produced by 
chemical semi-synthetic methods (Kaneko et aL, 2000 and references cited therein), or, 
alternatively, via PKS engineering, such methods are well known in the art (for example WO 
93/13663, WO 98/01571, WO 98/01546, WO 98/49315, Kato, Y. et aL, 2002) ). 



Moreover, the process of the host cell selection further comprises the optional step of 
deleting or inactivating or adding or manipulating genes in the host cell. This process 
comprises the improvement of recombinant host strains for the preparation and isolation of 
compounds of the invention, in particular 5-0-dedesosaminyl-5-0-mycaminosyl 
5 erythromycins and 5-0-dedesosaminyl-5-0-mycaminosyl azithromycins, specifically 5-0- 
dedesosaminyI-5-0-mycaminosyl erythromycin A, 5-0-dedesosaminyl-5-0-mycaminosyl 
erythromycin C, 5-0-dedesosaminyI-5-0-mycaminosyl erythromycin B, 5-0-dedesosaminyl- 
5-0-mycaminosyl erythromycin D and 5-0-dedesosaminyl-5-0-mycaminosyl azithromycin. 
This approach is exemplified in Example 1 by introducing an eryBVI mutation into the 

10 chromosome of S. erythraea SGQ2 in order to optimise the conversion of the substrate 3-0- 
mycarosyl erythronolide B to 5-0-dedesosaminyl-5-0-mycaminosyl erythromycins. 

In a further aspect the invention relates to the construction of gene cassettes. The 
cloning method used to isolate these gene cassettes is analogous to that used in 
PCT/GB03/003230 and diverges significantly from the approach previously described (WO 

15 01/79520) by assembling the gene cassette directly in an expression vector rather than pre- 
assembling the genes in pUC18/19 plasmids, thus providing a more rapid cloning procedure 
for the isolation of gene cassettes. The strategy for isolating these gene cassettes is 
exemplified in Example 1 to Example 1 1 . A schematic overview of the strategy is given in 
Figure 2. 

20 Another aspect of the invention allows the enhancement of gene expression by 

changing the order of genes in a gene cassette, the genes including but not limited to tylMI, 
tylMIII, tylB, eryCVI, tylAI, tyiAII, eryCIIl eryBV, angAI, angAII, angMIII, angB, angMI, 
angorfl4 f angorf4, eryBVI eryK, eryG, angMII, tylMFI, desVII„midI, spnO, spnN, spnP and 
genes with similar functions, allowing the arrangement of the genes in a multitude of 

25 permutations (Figure 2). 

The cloning strategy outlined in this invention also allows the introduction of a 
histidine tag in combination with a terminator sequence 3 ' of the gene cassette to enhance 
gene expression (see Example 1). Those skilled in the art will appreciate other terminator 
sequences well known in the art could be used. See, for example Bussiere and Bastia (1999), 

30 Bertram et at (2001) and Kieser et at (2000), incorporated herein by reference. 

Another aspect of the invention comprises the use of alternative promoters such as 
ptipA (Ali et ah, 2002) and/or ptr (Salah-Bey et at, 1995) to express genes and/or assembled 
gene cassette(s) to enhance expression. 

Another aspect of the invention describes the multiple uses of promoter sequences in 

35 the assembled gene cassette to enhance gene expression as exemplified in Example 6. 
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Another aspect of the invention describes the addition of genes encoding for aNDP- 
glucose-synthase such as tylAI and a NDP-glucose-4,6-dehydratase such as tylAII to the gene 
cassette in order to enhance the endogenous production of the activated sugar substrate. Those 
skilled in the art will appreciate that alternative sources of equivalent sugar biosynthetic 
pathway genes may be used. In this context alternative sources include but are not limited to: 
TvlAI- homologues : DesIII of Streptomyces venezuelae (accession no AAC68682), 
GrsD of Streptomyces griseus (accession no AAD31799), AveBIII of Streptomyces 
avermitilis (accession no BAA84594), Gtt of Saccharopolyspora spinosa (accession 
no AAK83289), SnogJ of Streptomyces nogalater (accession no AAF01820), AclY of 
Streptomyces galilaeus (accession no BAB72036), LanG of Streptomyces cyanogenus 
(accession no AAD13545), Graorfl6(GraD) of Streptomyces violaceoruber 
(accession no AAA99940), OleS of Streptomyces antibioticus (accession no 
AAD55453) and StrD of Streptomyces griseus (accession no A26984) and AngAI of 
S. eurythermus. 

TvlAII- homologues : AprE of Streptomyces tenebrarius (accession no AAG 18457), 
GdH of S. spinosa (accession no AAK83290), DesIV of S. venezuelae (accession no 
AAC68681), GdH of S. erythraea (accession no AAA68211), AveBII of S. 
avermitilis (accession no BAA84593), ScfSl .08C of Streptomyces coelicolor 
(accession no CAB61555), LanH of S. cyanogenus (accession no AAD13546), 
Graorfl7 (GraE) of £ violaceoruber (accession no S58686), OleE of S. antibioticus 
(accession no AAD55454), StrE of S. griseus (accession no P29782) and AngAII of 
S. eurythermus. 

Similarly, alternative sources for activated sugar biosynthesis gene homologues to 
tylMTII, angAIII, eryCII, tylMI, angMII, tylB, angB, eryCI, tylMI, angMI, eryCVI, tylla, 
angorfl4, angorf4, spnO, eryBVI, eryBV, eryCIII, desVII, midl, spnNandspnP will readily 
occur to those skilled in the art, and can be used. 

Another aspect of the invention describes the use of alternative glycosy transferases 
in the gene cassettes such as EryCIII. Those skilled in the art will appreciate that alternative 
glycosyltransferases may be used. In this context alternative glycosyltransferases include but 
are not limited to: TylMII (Accession no CAA57472), DesVII (Accession noAAC68677), 
MegCIII (Accession no AAG13921), MegDI (Accession no AAG13908) or AngMII of S. 
eurythermus. 

In one aspect of the present invention, the gene cassette may additionally comprise a 
chimeric gly cosy transferase (GT). This is particularly of benefit where the natural GT does 
not recognise the combination of sugar and aglycone that is required for the synthesis of the 
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desired analogue. Therefore, in this aspect the present invention specifically contemplates the 
use of a chimearic GT wherein part of the GT is specific for the recognition of the sugar 
whose synthesis is directed by the genes in said expression cassette when expressed in an 
appropriate strain background and part of the GT is specific for the aglycone or 
5 pseudoaglycone template (Hu and Walker, 2002). 

Those skilled in the art will appreciate that different strategies may be used for the 
introduction of gene cassettes into the host strain, such as site-specific integration vectors 
(Smovkina et a!., 1990; Lee etaL, 1991; Matsuura et al 9 1996; Van Mellaert et al. 9 1998; 
Kieser et al. 9 2000). Alternatively, plasmids containing the gene cassettes may be integrated 

1 0 into any neutral site on the chromosome using homologous recombination sites. Further, for 
a number of actinomycete host strains, including S. erythraea, the gene cassettes may be 
introduced on self-replicating plasmids (Kieser et al 9 2000; WO 98/01571). 

A further aspect of the invention provides a process for the production of compounds 
of the invention and optionally for the isolation of said compounds. 

15 A further aspect of the invention is the use of different fermentation methods to 

optimise the production of the compounds of the invention as exemplified in Example 1. 
Another aspect of the invention is the addition of ery genes such as eryK and/or eryG into the 
gene cassette. One skilled in the art will appreciate that the process can be optimised for the 
production of a specific erythromycin (i.e. A, B, C, D) or azithromycin by manipulation of the 

20 genes eryG (responsible for the methylation on the mycarose sugar) and/or eryK (responsible 
for hydroxylation at CI 2). Thus, to optimise the production of the A-form, an extra copy of 
eryK may be included into the gene cassette. Conversely, if the erythromycin B analogue is 
required, this can be achieved by deletion of the eryK gene from the S. erythraea host strain, 
or by working in a heterologous host in which the gene and/or its functional homologue, is 

25 not present. Similarly, if the erythromycin D analogue is required, this can be achieved by 
deletion of both eryG and eryK genes from the S. erythraea host strain, or by working in a 
heterologous host in which both genes and/or their functional homologues are not present. 
Similarly, if the erythromycin C analogue is required, this can be achieved by deletion of the 
eryG gene from the S. erythraea host strain, or by working in a heterologous host in which the 

30 gene and/or its functional homologues are not present. 

In this context a preferred host cell strain is a mammalian cell strain, fungal cells 
strain or a prokaryote. More preferably the host cell strain is Pseudomonas, mxyobacteria or 
E. coll In a more preferred embodiment the host cell strain is an actinomycete, still more 
preferably including, but not limited to Saccharopolyspora erythraea, Streptomyces 

35 coelicolor, Streptomyces avermitilis, Streptomyces griseofuscus, Streptomyces 
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cinnamonensis, Streptomyces fradiae, Streptomyces eurythermus, Streptomyces 
longisporoflavus, Streptomyces hygroscopicus, Saccharopolyspora spinosa, Micromonospora 
griseorubida, Streptomyces lasaliensis, Streptomyces venezuelae, Streptomyces antibioticus, 
Streptomyces lividans, Streptomyces rimosus, Streptomyces albus, Amycolatopsis 
5 mediterranei, Nocardia sp, Streptomyces tsukubaensis and Actinoplcmes sp. N902-109. In a 
still more preferred embodiment the host cell strain is selected from Saccharopolyspora 
erythraea, Streptomyces griseofuscus, Streptomyces cinnamonensis, Streptomyces albus, 
Streptomyces lividans, Streptomyces hygroscopicus sp., Streptomyces hygroscopicus var. 
ascomyceticus, Streptomyces longisporoflavus, Saccharopolyspora spinosa, Streptomyces 
10 tsukubaensis, Streptomyces coelicolor, Streptomyces fradiae, Streptomyces rimosus, 

Streptomyces avermitilis, Streptomyces eurythermus, Streptomyces venezuelae, Amycolatopsis 
mediterranei. In the most highly preferred embodiment the host strain is Saccharopolyspora 
erythraea. 

The present invention provides methods for the production and isolation of 
15 compounds of the invention, in particular of erythromycin and azithromycin analogues which 
differ from the natural compound in the glycosylation of the C-5 position, for example but 
without limitation: novel 5-0-dedesosaminyl-5-0-mycaminosyl or angolosaminyl 
erythromycins and 5-0-dedesosaminyl-5-0-mycaminosyl, or angolosaminyl azithromycins 
which are useful as anti-microbial agents for use in human or animal health. 
20 In further aspects the present invention provides novel products as obtainable by any 

of the processes disclosed herein. 

Brief description of Figures 

Figure J A: Structures of 5-O-dedesosaminyl-5-0-mycaminosyl erythromycin A, 5-0- 
25 dedesosaminyl-5-0~mycaminosyl erythromycin B and 5-O-dedesosaminyl-5-0-mycaminosyl 
erythromycin C. 

Figure IB: Structure of 5-0-dedesosaminyl-5-0-mycaminosyl azithromycin. 

30 Figure 2: Schematic overview over the gene cassette cloning strategy. Vector pSG144 

was derived from vector pSG142 (Gaisser et al., 2000). Abbreviations: dam': DNA isolated 
from dam strain background, Xbal met : Xbal site sensitive to Dam methylation, eryRHS: DNA 
fragment of the right hand side of the e/^-cluster as described previously (Gaisser et al, 
2000). 

35 
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Figure 3: Amino acid comparison between the published sequence of TylAl (below) and the 
amino acid sequence detected from the sequencing data described in this invention (above). 
The changes in the amino acid sequence are underlined. 

5 Figure 4: Amino acid comparison between the published sequence of TylAII (below) and the 
amino acid sequence detected from the sequencing data described in this invention (above). 
The changes in the amino acid sequence are underlined. 

Figure 5: Structure of 5-O-angolosaminyl tylactone. 

10 

Figure 6: Shows an overview of the angolamycin polyketide synthase gene cluster. 

Figure 7: The DNA sequence which comprises orfl4 and orfl5 (angB) from the 

angolamycin gene cluster. 

15 

Figure 8: The DNA sequence which comprises orfl (angAI), or/3 (angAII) and orf4 

from the angolamycin gene cluster. 

Figure 9: The DNA sequence which comprises orfl * (angMIII), orfl* (angMFI) t and 

20 orf3* (angMI) from the angolamycin gene cluster. 

Figure 10: The amino acid sequence which corresponds to orfl {angAI). 

Figure 1 1: The amino acid sequence which corresponds to orf3 {angAII), 

25 

Figure 12: The amino acid sequence which corresponds to orf4. 



Figure 13: The amino acid sequence which corresponds to orfl 4. 

30 Figure 14: The amino acid sequence which corresponds to orfl 5 (angB). 

Figure 15: The amino acid sequence which corresponds to orfl* (angMIII). 

Figure 16: The amino acid sequence which corresponds to orfl* (angMlI). 

35 
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Figure 17: The amino acid sequence which corresponds to orf3* (angMI). 
General Methods 

5 Escherichia coli XLl-Blue MR (Stratagene), E. coli DH10B (GibcoBRL) and E. coli 

ET 12567 were grown in 2xTY medium as described by Sambrook et aL, (1989). Vector 
pUC18, pUC19 and Litmus 28 were obtained from New England Biolabs. E. coli 
transformants were selected with 100 p.g/mi ampicillin. Conditions used for growing the 
Saccharopolyspora erythraea NRRL 2338-red variant strain were as described previously 

10 (Gaisser et ah, 1997, Gaisser et aL, 1998). Expression vectors in S. erythraea were derived 
from plasmid pSG142 (Gaisser et ah, 2000). Plasmid-containing S. erythraea were selected 
with 25-40 |ig/ml thiostrepton or 50 ^g/ml apramycin. To investigate the production of 
antibiotics, S. erythraea strains were grown in sucrose-succinate medium (Caffrey et aL, 
1992) as described previously (Gaisser et aL, 1997) and the cells were harvested by 

15 centrifugation. Chromosomal DNA of Streptomyces rochei ATCC21250 was isolated using 
standard procedures (Kieser et aL, 2000). Feedings of 3-O-mycarosyl erythronolide B or 
ty lactone were carried out at concentrations between 25 to 50 mg /L 

DNA manipulation and sequencing 

20 DNA manipulations, PGR and electroporation procedures were carried out as 

described in Sambrook et aL, (1989). Protoplast formation and transformation procedures of 
S. erythraea were as described previously (Gaisser et aL, 1997). Southern hybridizations were 
carried out with probes labelled with digoxigenin using the DIG DNA labelling kit 
(Boehringer Mannheim). DNA sequencing was performed as described previously (Gaisser et 

25 aL, 1997), using automated DNA sequencing on double stranded DNA templates with an ABI 
Prism 3700 DNA Analyzer. Sequence data were analysed using standard programs. 

Extraction and mass spectrometry 

1 ml of each fermentation broth was harvested and the pH was adjusted to pH 9. For 
30 extractions an equal volume of ethyl acetate, methanol or acetonitrile was added, mixed for at 
least 30 min and centrifiiged. For extractions with ethyl acetate, the organic layer was 
evaporated to dryness and then re-dissolved in 0.5 ml methanol. For methanol and acetonitrile 
extractions, supernatant was collected after centrifugation and used for analysis. High 
resolution spectra were obtained on a Bruker BioApex II FT-ICR (Bruker, Bremen, FRG). 
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Analysis of culture broths 

An aliquot of whole broth (1 ml) was shaken with CH 3 CN (1 ml) for 30 minutes. The mixture 
was clarified by centrifugation and the supernatant analysed by LCMS. The HPLC system 
comprised an Agilent HP 1 100 equipped with a Luna 5 jam C18 BDS 4.6 x 250 mm column 
5 (Phenomenex, Macclesfield, UK) heated to 40°C. The gradient elution was from 25% mobile 
phase B to 75% mobile phase B over 19 minutes at a flow rate of 1 ml/min. Mobile phase A 
was 10% acetonitrile: 90% water, containing 10 mM ammonium acetate and 0.15% formic 
acid, mobile phase B was 90% acetonitrile: 10% water, containing 10 mM ammonium acetate 
and 0.15% formic acid. The HPLC system described was coupled to a Bruker Daltonics 
10 Esquire3000 electrospray mass spectrometer operating in positive ion mode. 

Extraction and purification protocol: 

For NMR analysis of 5-0-dedesosaminyl-5-0-mycaminosyl erythromycin A the 
fermentation broth was clarified by centrifugation to provide supernatant and cells. The 

15 supernatant was applied to a column (16 x 15 cm) of Diaion® HP20 resin (Supelco), washed 
with 10% Me 2 CO/H 2 0 (2x21) and then eluted with Me 2 CO (3.5 1). The cells were mixed to 
homogeneity with an equal volume of Me 2 CO/MeOH (1:1). After at least 30 minutes the 
slurry was clarified by centrifugation and the supernatant decanted. The pelleted cells were 
similarly extracted once more with Me 2 CO/MeOH (1:1). The cell extracts were combined 

20 with the Me 2 CO from the HP20 column and the solvent was removed in vacuo to give an 

aqueous concentrate. The aqueous was extracted with EtOAc (3 x) and the solvent removed 
in vacuo to give a crude extract. The residue was dissolved in CHsCN/MeOH and purified by 
repeated rounds of reverse phase (CI 8) high performance liquid chromatography using a 
Gilson HPLC, eluting a Phenomenex 21.2 x 250 mm Luna 5 pm CI 8 BDS column at 21 

25 ml/min. Elution with a linear gradient of 32.5% B to 63% B was used to concentrate the 

macrolides followed by isocratic elution with 30% B to resolve the individual erythromycins. 
Mobile phase A was 20 mM ammonium acetate and mobile phase B was acetonitrile. 
High resolution mass spectra were acquired on a Bruker BioApex II FTICR (Bruker, Bremen, 
Germany). 

30 

For NMR analysis of 5-O-angolosaminyl tylactone bioconversion experiments were 
performed as previously described with four 2 1 flasks containing each 400 ml of SSDM 
medium inoculated with 5% of pre-cultures. Feedings with tylactone were carried out at 50 
mg/1. The culture was centrifuged and the pH of the supernatant was adjusted to about pH 9 
35 followed by extractions with three equal volumes of ethyl acetate. The cell pellet was 



16 



extracted twice with equal volumes of a mixture of acetone-methanol (50:50, vol/vol). The 
extracts were combined and concentrated in vacuo. The resulting aqueous fraction was 
extracted three times with ethyl acetate and the extracts were combined and evaporated until 
dryness. This semi purified extract was dissolved in methanol and purified by preparative 
HPLC on a Gilson 315 system using a 21 mm x 250 mm Prodigy ODS3 column 
(Phenomenex, Macclesfield, UK). The mobile phase was pumped at a flow rate of 21 ml/min 
as a binary system consisting of 30% CH 3 CN, 70% H 2 0 increasing linearly to 70% CH 3 CN 
over 20 min. 

Sequence Information 



Table I- Seq uence inform ation for the aneolosamine biosvnthetic genes included in the 
cassettes 



Gene (named according to tyl 
equivalent) 


Bases in Figure 


Corresponding polypeptide 
Figure number 


or/2 (angAI) 


14847-1573 lc from Figure 8 


Figure 10 

NDP-hexose synthase 


or/3 (angAII) 


13779-14774c from Figure 8 


Figure 1 1 

NDP-hexose 4,6-dehydratase 


orf4 

(N-part) 
(C-part) 


i ijuo-jl Joooc irom rigure 8 


Figure 12 

typell thioesterase 

NDP-hexose 2,3 -dehydratase 


or/14 


1 162-2 160c from Figure 7 


Figure 13 

NDP-hexose 4-ketoreductase 


or/15 (angB) 


33-1 151c from Figure 7 


Figure 14 

NDP-hexoseaminotransferase 


orfl* (angMIII) 


59800-61 140 from Figure 9 


Figure 15 

Hypothetical NDP hexose 3,4 
isomerase 


or/2* (angMII) 


61 159-62430 from Figure 9 


Figure 16 

angolosaminyl glycosyl 
transferase 


or/3* (angMI) 


62452-63 171 from Figure 9 


Figure 17 

N,N-dimethyl transferase 



Note : c indicates that the gene is encoded by the complement DNA strand 
potential functions of the predicted polypeptides (SEQ ID No.7 to 34) were obtained 
from the NCBI database using a BLAST search. 
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Example 1: Byconversion of 3-0-mycarosyl erythronolide B to 5-0- 
dedesosaminyl-5-0-mycaminosyl erythromycins using gene cassette 
vSGlMtylAItylAIItylMIIItylBtyllatylMIeryCIII. 



Isolation of pSG143 

Plasmid pSG142 (Gaisser et ah, 2000) was digested with Xbal and a fill-in reaction 
was performed using standard protocols. The DNA was re- ligated and used to transform E. 
coli DH10B. Construct pSG143 was isolated and the removal of the Xbal site was confirmed 
by sequence analysis. 

Isolation ofpUC18eryBVcas 

The gene eryBV was amplified by PCR using the primers cas01eG21 (WOO 1/79520) 
and 7966 S'-GGGGAATTCAGAICTGGTCTAGAGGTCAGCCGGCGTGGCGGCGCGTG 
AGTTCCTCCAGTCGCGGGACGATCT -3' and pSG142 (Gaisser et al, 2000) as template. 
The PCR fragment was cloned using standard procedures and plasmid P UC18eryBVcas was 
isolated with an Ndel site overlapping the start codon of errand Xbal and BgRl sites 
(underlined) following the stop codon. The construct was verified by sequence analysis. 

Isolation of vector pSGLitl 

The isolation of this vector is described in PCT/GB03/003230. 
Isolation of pSGLitl eryCIII 

Plasmid pSGCIII (WO01/79520) was digested with NdeVBgRl and the insert fragment was 
isolated and ligated with the NdeVBgRl treated vector fragment of pSGLitl. The ligation was 
used to transform E. coli ET12567 and plasmid pSGLitl eryCIII was isolated using standard 
procedures. The construct was confirmed using restriction digests and sequence analysis. This 
cloning strategy allows the introduction of a /ws-tag C-terminal of EryCIII. 

Isolation of pSGLitl tylMI 

Plasmid P SGTYLM2 (WOO 1/7952) was digested with NdeVBgRl and the insert fragment was 
isolated and ligated with the NdeVBgRl treated vector fragment of pSGLitl. The ligation was 
used to transform E. coli ET12567 and plasmid pSGLitl tylMII was isolated using standard 
procedures. The construct was confirmed using restriction digests and sequence analysis. This 
cloning strategy allows the introduction of a to-tag C-terminal of TylMII. 
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Isolation ofpSG144 

Plasmid pSGLitl was isolated and digested with NdeVBgRl and an approximately 1.3 
kb insert was isolated. Plasmid pSG143 was digested with NdeVBgRl, the vector band was 
isolated and ligated with the approximately 1.3 kb bandjfrom pSGLitl followed by 
transformation of E. coli DH10B. Plasmid pSG144 (F J) 2) was isolated and the construct 
was verified by DNA sequence analysis. This vector allows the assembly of gene cassettes 
directly in an expression vector (Figure 2) without prior assembly in pUC-derived vectors 
(WO 01/79520) in analogy to PCT/GB03/003230 using vector pSG144 instead of pSGsetl. 
Plasmid pSG144 differs from pSG142 in that the Xbal site between the thiostrepton resistance 
gene and the eryRHS has been deleted and the his- tag at the end of eryBV has been removed 
from pSG142 and replaced in pSG144 with an Xbal site at the end of eryBV. This is to 
facilitate direct cloning of genes to replace eryBV and then build up the cassette. 

Isolation ofpSG144eryCIII 

EryCHI was amplified by PCR reaction using standard protocols, with primers 
cas01eG21 (WO 01/79520) and caseryCIII2 (WO 01/79520) and plasmid pSGCIII (Gaisser et 
aL, 2000) as template. The approximately 1 .3 kb PCR product was isolated and cloned into 
pUC18 using standard techniques. Plasmid pUCCIIIcass was isolated and the sequence was 
verified. The insert fragment of plasmid pUCCIIIcass was isolated after NdeVXbal digestion 
and ligated with the NdeVXbal digested vector fragment of pSG144. After the transformation 
ofE. coli DH10B plasmid pSG\44eryCIII was isolated using standard techniques. 

Isolation ofpUC19tylAI 

Primers BIOSG34 5'- 
GGG CATATGA ACGACCGTCCCCGCCGCGCCATGAAGGG- 3' and 5'- 
CCCCTCTAGAGGTC ACTGTGCCCGGCTGTCGGCGGCGGCCCCGCGC ATGG-3 ' were 
used with genomic DNA of Streptomyces jradiae as template to amplify tylAI. The amplified 
product was cloned using standard protocols and plasmid p\JCl9tylAI was isolated. The insert 
was verified by DNA sequence analysis. Differences to the published sequence are shown in 
Figure 3. 

Isolation of pSGLit2 

Plasmid Litmus 28 was digested with SpeVXbal and the vector fragment was isolated. 
Plasmid pSGLitl {dam) was digested with Xbal and the insert band was isolated and ligated 
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with the Spel/Xbal digested vector fragment of Litmus 28 followed by the transformation of 
E. coli DH10B using standard techniques. Plasmid pSGLit2 was isolated and the construct 
was verified by restriction digest and sequence analysis. This plasmid can be used to add a 5' 
region containing an Xbal site sensitive to Dam methylation and a Shine Dalgarno region thus 
5 converting genes which were originally cloned with an Ndel site overlapping the start codon 
and an Xbal site 3' of the stop codon for the assembly of gene cassettes. This conversion 
includes the transformation of the ligations into E. co//ET12567 followed by the isolation of 
dam DNA and Xbal digests. Examples for this strategy are outlined below. 

1 0 Isolation of pSGLU2tylAI 

Plasmid pSGLit2 and pUC 1 9tylAI were digested with Ndel/ Xbal and the insert band 
of pUC 1 9tylAI and the vector band of pSGLit2 were isolated, ligated and used to transform E. 
coli ET12567. Plasmid pSGLitltylAI (dam') was isolated. 

1 5 Isolation ofpUC19tylAII 
Primers 5' — 

CCCCT^TAGAGGTCATGCGCGCTCCAGTTCCCTGCCGCCCGGGGACCGCTTG- 3 ' 
and5'~ 

GGGTCTAGATCGATTAATTAAGGAGGACATTCATGCGCGTCCTGGTGACCGGAGG 
20 TGCGGGCTTCATCGGCTCGC ACTTC A- 3 ' and genomic DNA of Streptomyces fradiae as 
template were used for a PCR reaction applying standard protocols to amplify tylAIL The 
approximately 1 kb sized DNA fragment was isolated and cloned into Smal-out pUC19 using 
standard techniques. The DNA sequencing of this construct revealed that 12 nucleotides at the 
5' end had been removed possibly by an exonuclease activity present in the PCR reaction. 
25 The comparison of the amino acid sequence of the cloned fragment compared to the published 
sequence is shown in Figure 4. 

Isolation of pSGLit2tylAII 

To add the missing 5 '-nucleotides, pSGLit2 was digested with PacVXbal and the 
30 vector fragment was isolated and ligated with the PacllXbal digested insert fragment of 
p\JCl9 tylAIL The ligated DNA was used to transform E. coli ET12567 and plasmid 
pSGLitltylAII (dam) was isolated. 
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Isolation ofplasmid pUC19eryCVI 

The eryCVI gene was amplified by PGR using primer BIOSG28 5'- 
GGGCATA1GTACGAGGGCGGGTTCGCCGAGCTTTACGACC-3 5 and BIOSG29 5'- 
GGGGTCTAGAGGTCATCCGCGCACACCGACGAACAACCCG-3 5 and plasmid 
pNCQ62 (Gaisser et al, 1997) as a template. The PGR product was cloned into Smal digested 
pUC19 using standard techniques and plasmid pUCl9eryCVIvsas isolated and verified by 
sequence analysis. 

Isolation of plasmid pSGLU2eryCVI 

Plasmid pUC\9eryCVI was digested with NdeUXbal and ligated with the NdeVXbal 
digested vector fragment of pSGLit2 followed by transformation of E. coli ET12567. Plasmid 
pSGLitleryCVI (dam') was isolated. 

Isolation of plasmid pSG144tylAI 

Plasmid pSG144 and p\5C\9tylAI were digested with NdeVXbal and the insert band of 
p\5C\9tylAI and the vector band of pSG144 were isolated, ligated and used to transform E. 
coli DH10B. Plasmid pSG144#/.4/ was isolated using standard protocols. 

Isolation of plasmid pSG144tylAItylAII 

Plasmid pSGlAtltylAII (dam) was digested with Xbal and ligated with Xbal digested plasmid 
pSGlAAtylAI. The ligation was used to transform E. coli DH10B and plasmid 
pSGl44tylAItylAII was isolated and verified using standard protocols. 

Isolation of plasmid pSGLU2tylMIII 

Plasmid pUC18tylM3 (Isolation described in WO01/79520) was digested with NdeVXbal and 
the insert band and the vector band of NdeVXbal digested pSGLit2 were isolated, ligated and 
used to transform E. coli ET12567. Plasmid pSGLit2tylMIII(dam) was isolated using 
standard protocols. The construct was verified using restriction digests and sequence analysis. 

Isolation of plasmid pSG144tylAItylAIItylMIII 

Plasmid pSGIAHtylMIII (dam) was digested with Xbal and the insert band was ligated with 
Xbal digested plasmid pSG\44tylAItylAII The ligation was used to transform E. coli DH10B 
and plasmid pSGU4tylAItylAIItylMIII no36 was isolated using standard protocols. The 
construct was verified using restriction digests and sequence analysis. 
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Isolation of plasmid pSGLU2tylB 

Plasmid pUClStylB (Isolation described in WO01/79520) was digested with PacllXbal and 
the insert band and the vector band of PacllXbal digested pSGLit2 were isolated, ligated and 
used to transform E, coli ET12567. Plasmid pSGUtltylB nol {dam) was isolated using 
5 standard protocols. 

Isolation of plasmid pSG144tylAItylAIItylMIItylB 

Plasmid pSGLit2(y/£ {dam) was digested with Xbal and the insert band was ligated with 
Xbal digested plasmid pSG 1 AAtylAItylAIItylMIII. The ligation was used to transform E. coli 
10 DH10B and plasmid pSG 1 AAtylAItylAIItylMIIItylB no5 was isolated using standard protocols 
and verified by restriction digests and sequence analysis. 

Isolation of plasmid pUC18tylIa 

Primers BIOSG 88 5 '-GGGCATATGGCGGCGAGCACTACGACGGAGGGGAATGT-3 3 
1 5 and BIOSG 89 5 5 -GGGTCTAGAGGTCACGGGTGGCTCCTGCCGGCCCTCAG-3 5 were 
used to amplify tylla using a plasmid carrying the tyl region (accession number 
u08223.em_pro2) comprising ORF1 (cytochrome P450) to the end of ORF2 (TylB) as a 
template. Plasmid pUCtylla nol was isolated using standard procedures and the construct was 
verified using sequence analysis. 

20 

Isolation of plasmid pSGLit2tylIa 

Plasmid pUCtylla nol was digested with NdeVXbal and the insert band and the vector band 
of NdeVXbal digested pSGLit2 were isolated, ligated and used to transform E. coli ET12567. 
Plasmid pSGLitltylla no 54 {dam) was isolated using standard protocols. The construct was 
25 verified using sequence analysis. 



Isolation of plasmid pSG144tylAItylAIItylMIIItylBtylIa 

Plasmid pSGLitltylla {dam) was digested With Xbal and the insert band was ligated with 
Xbal digested plasmid pSG 1 AAtylAItylAIItylMIIItylB. The ligation was used to transform E. 
30 coli DH10B and plasmid pSG 1 AAtylAItylAIItylMIIItylBtylla no3 was isolated using standard 
protocols and verified by restriction digests and sequence analysis. 

Isolation of plasmid pSGLitltylMeryCIII 

Plasmid pUCtylMI (Isolation described in WOO 1/79520) was Pad digested and the insert was 
3 5 ligated with the Pad digested vector fragment of pSGLit 1 eryCIII using standard procedures . 
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Plasmid pSGLitl tylMIeryCIII no20 was isolated and the orientation was confirmed by 
restriction digests and sequence analysis. 

Isolation of gene cassette pSG144tylAItylAIItylMIIItylBtyllatylMIeryCIII 
5 Plasmid pSGLitl tylMIeryCIII no20 was digested with XbaVBgRl and the insert band was 
isolated and ligated with the XbaVBgRl digested vector fragment of plasmid 
pSGl44tylAItylAIItylMIIItylBtylIa no3. Plasmid 

pSG144tylAItylAIItylMIIItylBtyllatylMIeryCIII was isolated using standard procedures and 
the construct was confirmed using restriction digests and sequence analysis. Plasmid 
10 preparations were used to transform S. erythraea mutant strains with standard procedures. 

Isolation of plasmid pSGKCl 

To prevent the conversion of the substrate 3-0-mycarosyl erythronolide B to 3,5-di-O 
mycarosyl erythronolide B a further chromosomal mutation was introduced into S. erythraea 
1 5 SGQ2 (Isolation described in WO 01/79520) to prevent the biosynthesis of L-mycarose in the 
strain background. Plasmid pSGKCl was isolated by cloning the approximately 0.7 kb DNA 
fragment of the eryB VI gene by using PGR amplification with cosmid2 or plasmid pGGl 
(WOO 1/79520) as a template and with the primers 646 5 5 - 

CATCGTCAAGGAGTTCGACGGT- 3 5 and 874 5'-GCCAGCTCGGCGACGTCCATC- 
20 3' using standard protocols. Cosmid 2 containing the right hand site of the ery-cluster was 
isolated from an existing cosmid library (Gaisser et aL, 1997) by screening with eryBVas a 
probe using standard techniques. The amplified DNA fragment was isolated and cloned into 
EcoRV digested pKCl 132 (Bierman et aL, 1992) using standard methods. The ligated DNA 
was used to transform E. coli DH10B and plasmid pSGKCl was isolated using standard 
25 molecular biological techniques. The construct was verified by DNA sequence analysis. 

Isolation ofS. erythraea Q42/1 (Biot-2166) 

Plasmid pSGKCl was used to transform S. erythraea SGQ2 using standard techniques 
followed by selection with apramycin. Thiostrepton/apramycin resistant transformant S. 
30 erythraea Q42/1 was isolated. 

Bioconversion using S. erythraea Q42/lpSG144tylAItylAIItylMIIItylBtyllatylMIeryCIII 
Bioconversion assays using 3-O-mycarosyl erythronolide B are carried out as described in 
General Methods. Improved levels of mycaminosyl erythromycin A are detected in 
35 bioconversion assays using S. erythraea 
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Q42/1 pSG 1 AAtylAItylMItylMIIItylBtyllatylMIeryCIII compared to bioconversion levels 
previously observed (WOO 1/79520). 

Example 2: Isolation of mycaminosyl ty lactone using gene cassette 
5 pSG144tylAItyUIItylMIIItylBtylIafylMJtylMII 

Isolation ofplasmid pSGLitl tylMItylMII 

Plasmid pUCtylMI (Isolation described in WOO 1/79520) was Pad digested and the insert was 
ligated with the Pad digested vector fragment of pSGLitlfy/MZ/ using standard procedures. 
10 Plasmid pSGLitl tylMItylMII no 16 was isolated and the construct was confirmed by 
restriction digests and sequence analysis. 

Isolation ofplasmid pSGI44tylAItylAIItylMIIItylBtylIatylAdItylMII 

Plasmid pSGLitl tylMItylMII no 1 6 was digested with XbaVBglH and the insert band was 
1 5 isolated and ligated with the XbaVBglll digested vector fragment of plasmid 
pSG 1 AAtylAItylAIItylMIIItylBtylla no3. Plasmid 

pSGlAAtylAItylAIItylMIIItylBtyllatylMItylMII was isolated using standard procedures and the 
construct was confirmed using restriction digests and sequence analysis. The plasmid was 
isolated and used for transformation of S. erythraea mutant strains using standard protocols. 

20 

Bioconversion using gene cassette pSG144tylAItylAIItylMIIItylBtyllatylMItylMII 
The conversion of fed tylactone to mycaminosyl tylactone was assessed in bioconversion 
assays using S. erythraea Q42/ lpSGl44tylAItylAIItylMIIItylBtyllatyimtyimL 
Bioconversion assays were carried out using standard protocols (see Chemical Request sheet 

25 81). The analysis of the culture showed the major ion to be 568.8 [M+H] + consistent with the 
presence of mycaminosyl tylactone. Fragmentation of this ion gave a daughter ion of m/z 174, 
as expected for protonated mycaminose. No tylactone was detected during the analysis of the 
culture extracts, indicating that the bioconversion of the fed tylactone was complete. 
Recently, a homologue of Tylla was identified in the biosynthetic pathway of dTDP-3- 

30 acetamido-3,6-dideoxy-alpha-D-galactose in Aneurinibacilhis thermoaerophilus L420-91 T * 
(Pfoestl et aL, 2003) and the function was postulated as a novel type of isomerase capable of 
synthesizing dTDP-6-deoxy-D-xylohex-3-ulose from dTDP-6-deoxy-D-xylohex-4-ulose. 
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Example 3: Byconversion of 3-0-mycarosyI erythronolide B to 5-0-dedesosaminyl-5-0- 
mycaminosyl erythromycins using gene cassette pSG144S/2 7/95/21/44/193/6eryCIII 
(pSG144angAIangAIIorfl4angMIIIangBangMIeryCIII). 

Cloning ofangMIII by isolating plasmid Lit 1/4 

The gene angMIIIv/zs amplified by PGR using the primers BIOSG61 5'- 
GGG CATATGA GCCCCGCACCCGCCACCGAGGACCC -3' and BIOSG62 5'- 
G GTCTAGA GGTCAGTTCCGCGGTGCGGTGGCGGGCAGGTCAC -3'. Cosmid5B2 
containing a fragment of the angolamycin biosynthetic pathway was used as template. The 1.4 
kb PGR fragment (PGR nol) was cloned using standard procedures and EcoKV digested 
plasmid Litmus28. Plasmid Utl/4 was isolated with an Ndel site overlapping the start codon 
of angMIIIand an Xbal site following the stop codon. The construct was verified by sequence 
analysis. 

Isolation of plasmid pSGLit21/4 

Plasmid lAtl/4 was digested with NdeVXbal and the about 1 .4 kb fragment was isolated and 
ligated to NdeVXbal digested DNA of pSGLit2. The ligation was used to transform E. coli 
ET12567 and plasmid pSGLit27/4 no7 {dam) was isolated. This construct was digested with 
Xbal and used for the construction of gene cassettes. 

Cloning ofangMII by isolating plasmid LU2/8 

The gene angMIIv/as amplified by PGR using the primers BIOSG63 5'- 
GGG CATATGC GTATCCTGCTGACGTCGTTCGCGCACAACAC -3' and BIOSG64 5'- 
GG TCTAGA GGTCAGGCGCGGCGGTGCGCGGCGGTGAGGCGTTCG -3' and 
cosmid5B2 containing a fragment of the angolamycin biosynthetic pathway was used as 
template. The 1.3 kb PCR fragment (PGR no2) was cloned using standard procedures and 
EcoKV digested plasmid Litmus28. Plasmid Lit2/S was isolated with an Ndel site overlapping 
the start codon of angMII and anXbal site following the stop codon. The construct was 
verified by sequence analysis. 

Cloning ofangMII by isolating plasmid pLitangMII(Bglll) 

The gene angMII was amplified by PCR using primers BIOSG63 5'- 

GGG CATATG CGTATCCTGCTGACGTCGTTCGCGCACAACAC -3' and BIOSG80 5'- 
GG AGATCTG GCGCGGCGGTGCGCGGCGGTGAGGCGTTCG -3' and cosmid5B2 
containing a fragment of the angolamycin biosynthetic pathway as template. The 1.3 kb PCR 
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fragment was cloned using standard procedures and EcoRV digested plasmid Litmus28. 
Plasmid LitangMII(BglII)noS was isolated with an Ndel site overlapping the start codon of 
angMII and a BgUl site instead of a stop codon thus allowing the addition of a his-tag. The 
construct was verified by sequence analysis. 

5 

Isolation of plasmid pSGLitl angMII 

Plasmid lA\angMU(BgRl) was digested with NdeVBglll and ligated with the NdellBgUl 
digested vector fragment of pSGLitl . The ligation was used to transform E. coli ET 12567 and 
plasmid pSGLitl angMII (darn ) was isolated using standard procedures. 

10 

Cloning ofangMI by isolating plasmid LU3/6 

The gene angMlwas amplified by PGR using the primers BIOSG65 5'- 

GGGCATATGAACCTCGAATACAGCGGCGACATCGCCCGGTTG -3' and BIOSG66 5'- 
GGTCTAGAGGTCAGGCCTGGACGCCGACGAAGAGTCCGCGGTCG -3' and 
15 cosmid5B2 containing a fragment of the angolamycin biosynthetic pathway was used as 

template. The 0.75 kb PGR fragment (PGR no3) was cloned using standard procedures and 
EcoRV digested plasmid Litmus28. Plasmid 11x3/6 was isolated with an Ndel site overlapping 
the start codon of angMIand an Xbal site following the stop codon. The construct was 
verified by sequence analysis. 

20 

Isolation of plasmid pSGlit23/6 no8 

Plasmid Liti/5 was digested with NdeUXbal and the about 0.8 kb fragment was isolated and 
ligated to NdeUXbal digested DNA of pSGLit2. The ligation was used to transform E. coli 
ET 12567 and plasmid pSGLit23/5 no8 (dam) was isolated. This construct was digested with 
25 Xbal and the isolated about 1 kb fragment was used for the assembly of gene cassettes. 

Cloning ofangB by isolating plasmid Lit4/ 19 

The gene angB was amplified by PGR using the primers BIOSG67 5'- 
GG GCATATGA CTACCTACGTCTGGGACTACCTGGCGG -3' and BIOSG68 5 f - 

30 GGTCTAGAGGTCAGAGCGTGGCCAGTACCTCGTGCAGGGC -3' and cosmid4H2 

containing a fragment of the angolamycin biosynthetic pathway was used as template. The 1.2 
kb PGR fragment (PGR no4) was cloned using standard procedures and EcoRV digested 
plasmid Litmus28. Plasmid LH4/19 was isolated with an Ndel site overlapping the start codon 
ofangB and an Xbal site following the stop codon. The construct was verified by sequence 

35 analysis. 
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Isolation ofplasmidpSGUt24/19 

Plasmid UM/19 was digested with NdeVXbal and the 1 .2 kb fragment was isolated and 
ligated into NdeVXbal digested DNA of pSGLit2. The ligation was used to transform E. coli 
5 ET12567 and plasmid pSGLit24//P no24 {dam ) was isolated. This construct was digested 
with Xbal and the isolated 1.2 kb fragment was used for the assembly of gene cassettes. 

Cloning of orfl4 by isolating plasmid Lit 5/2 

The gene orfl4 was amplified by PGR using the primers BIOSG69 5'- 
10 GGG CATATGG TGAACGATCCGATGCCGCGCGGCAGTGGCAG-3' and BIOSG70 5'- 
GG TCTAGAG GTCAACCTCCAGAGTGTTTCGATGGGGTGGTGGG-3 1 and cosmid4H2 
containing a fragment of the angolamycin biosynthetic pathway was used as template. The 1.0 
kb PGR fragment (PCR no5) was cloned using standard procedures and EcoKW digested 
plasmid Litmus28. Plasmid Lit5/2 was isolated with an Ndel site overlapping the start codon 
15 of ORF14 and an Xbal site following the stop codon. The construct was verified by sequence 
analysis. 

Isolation of plasmid pSGlit25/2 no24 

Plasmid Lit5/2 was digested with NdeVXbal and the approximately 1 kb fragment was 
20 isolated and ligated to NdeVXbal digested DNA of pSGLit2. The ligation was used to 

transform E. coli ET12567 and plasmid pSGLit25/2 no24 (dam) was isolated. This construct 
was digested with Xbal, the about 1 kb fragment isolated and used for the assembly of gene 
cassettes. 

25 Isolation of plasmid pSGlit2 7/9 nol5 

Plasmid Lit7/P was digested with NdeVXbal and the approximately 1 kb fragment was 
isolated and ligated to NdeVXbal digested DNA of pSGLit2. The ligation was used to 
transform E. coli ET12567 and plasmid pSGLit27/9 nol5 {dam) was isolated. This construct 
was digested with Xbal and the isolated 1 kb fragment was used for the assembly of gene 

30 cassettes. 

Cloning of angAI (orf2) by isolating plasmid LU8/2 
The gene angAI was amplified by PCR using the primers BIOSG73 5 f - 
GGG CATATGA AGGGCATCATCCTGGCGGGCGGCAGCGGC-3' and BIOSG74 5'- 
35 GG TCTAGA GGTCATGCGGCCGGTCCGGACATGAGGGTCTCCGCCAC-3' and 
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cosmid4H2 containing a fragment of the angolamycin biosynthetic pathway was used as 
template. The around 1.0 kb PCR fragment (PCR no8) was cloned using standard procedures 
and EcoRV digested plasmid Litmus28. Plasmid LitS/2 was isolated with an Ndel site 
overlapping the start codon of angAI and an Xbal site following the stop codon. The construct 
5 was verified by sequence analysis. 

Cloning ofangAII (or/3) by isolating plasmid LU7/9 
The gene angAIIwas amplified by PCR using the primers BIOSG71 5'- 
GGGCATATGCGGCTGCTGGTCACCGGAGGTGCGGGC-3 1 and BIOSG72 5'- 
10 GGTCTAGAGGTCAGTCGGTGCGCCGGGCCTCCTGCG-3 1 and cosmid4H2 containing a 
fragment of the angolamycin biosynthetic pathway was used as template. The 1.0 kb PCR 
fragment was cloned using standard procedures and EcoKV digested plasmid Litmus28. 
Plasmid Lit 7/9 was isolated with an Ndel site overlapping the start codon of angAII and an 
Xbal site following the stop codon. The construct was verified by sequence analysis. 

15 

Isolation of plasmid pSGlit28/2 no 18 (pSGLit2angAI) 

Plasmid LitS/2 was digested with NdeUXbal and the 1 kb fragment was isolated and ligated to 
NdeVXbal digested DNA of pSGLit2. The ligation was used to transform E. coli ET12567 
and plasmid pSGLit2S/2 no 18 {dam) was isolated. 

20 

Isolation of plasmid pSGl 448/2 (pSG144angAI) 

Plasmid LitS/2 was digested with NdeUXbal and the approximately 1 kb fragment was 
isolated and ligated with NdeVXbal digested DNA of pSG144. The ligation was used to 
transform E. coli DH10B and plasmid pSG144S/2 {dam) {pSG\44angAI) was isolated using 
25 standard procedures. This construct was verified with restriction digests and sequence 
analysis. 

Isolation of plasmid pSG 1448/27/9 (pSG144angAIangAII) 

Plasmid pSGLit2 7/9 (isolated from Ecoli ET12567) was digested with JK>aI and the 1 kb 
30 fragment was isolated and ligated with the Xbal digested vector fragment of pSG144S/2 
{pSGl44angAI), The ligation was used to transform E. coli DH10B and plasmid 
pSG 1445/2 7/9 (pSG 1 AAangAIangAII) was isolated using standard protocols. The construct 
was verified with restriction digests and sequence analysis. 

3 5 Isolation of plasmid pSG 1448/2 7/91/4 (pSG144angAIangAIIangMIII) 
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Plasmid p$GUt21/4 (isolated from E. coli ET 12567) was digested with Xbal and the 1.4 kb 
fragment was isolated and ligated with theZ&al digested vector fragment of pSG 1445/2 7/9 
(pSGl44angAIangAII). The ligation was used to transform E. coli DH10B and plasmid 
pSG\448/27/91/4 (pSGl44angAIangAIIangMII) was isolated using standard protocols. The 
5 construct was verified with restriction digests and sequence analysis. 

Isolation of plasmid pSGl 448/27/91/44/1 9 (pSG144angAIangAIIangMIIIangB) 
Plasmid pSGLit24//9 (isolated from E. coli ET12567) was digested with Xbal and the about 
1.2 kb fragment was isolated and ligated with the Xbal digested vector fragment of 
10 pSG1448/27/91/4 (pSGl44angAIangAIIangMIII). The ligation was used to transform E. coli 
DH10B and plasmid pSGl 445/2 7/91/44/19 (pSG\44angAIangAIIangMIIIangE) was isolated 
using standard protocols. The construct was verified with restriction digests and sequence 
analysis. 

1 5 Isolation of plasmid pSGl 448/2 7/91/44/193/6 (pSG144angAIangAIIangMIIIangBangMI) 
Plasmid pSGLit23/6 (isolated from E. coli ET12567) was digested with Xbal and the about 
0.8 kb fragment was isolated and ligated with the Xbal digested vector fragment of 
pSG 1 448/2 7/91/44/19 (pSG 1 44angAIangAIIangMIIIangB). The ligation was used to 
transform E. coli DH10B and plasmid pSGl448/2 7/9 1/44/1 93/6 

20 (pSG 1 44angAIangAIIangMIIIangBangMI) was isolated using standard protocols. The 
construct was verified with restriction digests and sequence analysis. 

Isolation of plasmid pSGl 448/27/91/44/1 93/6eryCIII 
(pSG144angAIangAIIangMIIIangBangMIeryCIII) 
25 Plasmid pSGLitl eryCIII (isolated from E. coli ET12567) was digested with Xbal/ BgUl and 
the about 1 .2 kb fragment was isolated and ligated with the Xbal digested and partially BgUl 
digested vector fragment of pSG 1445/2 7/91/44/193/6 

(pSG 1 44angAIangAIIangAHIIangBangMI). The BgUl partial digest was necessary due to the 
presence of a BgUl site in angB. The ligation was used to transform E. coli DH10B and 
30 plasmid pSG 1445/2 7/9 1/44/1 9 3/6eryCIII no9 

. (pSG 1 44angAIangAIIangMIIIangBangMIeryCIII) was isolated using standard protocols. The 
construct was verified with restriction digests and sequence analysis. EryCIII carries a tos-tag 
fusion at the end. 
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Bioconversion of 3-O-mycarosyl erythronolide B to 5-0-dedesosaminyl-5-0-mycaminosyl 
erythromycin A using S. erythraea g^2//pSG1448/27/91/44/193/6eryCfl7no9 
(pSG 1 AAangAIangAIIangMIIIangBangMIeryCIH) 

The S. erythraea strain Q42/ 1 p S G 1 448/2 7/91/44/1 93/6eryCIII was grown and bioconversions 
5 with fed 3-O-mycarosyl erythronolide B were performed as described in the General 

Methods. The cultures were analysed and a small amount of a compound with m/z 750 was 
detected consistent with the presence of 5-0-dedesosaminyl-5-0-mycaminosyl erythromycin 
A. 

1 0 Isolation ofplasmid pSG 1448/27/9 5/2 ( P SG144angAIangAIIorfl4) 

Plasmid pSGLit25/2 (isolated from E. coli ET12567) was digested with Xbal and the about 1 
kb fragment was isolated and ligated with the Xbal digested vector fragment of pSG 1448/2 7/9 
(pSG 1 AAangAIangAII). The ligation was used to transform E. coli DH10B and plasmid 
pSG 1448/2 7/95/2 (pSG 1 AAangAIangAIIorfl4) was isolated using standard protocols. The 

1 5 construct was verified with restriction digests and sequence analysis. 

Isolation ofplasmid pSG 1448/27/9 5/2 1/4 ( P SG144angAIangAIIorfl4angMIII) 
Plasmid pSGLit2//4 (isolated from E, coli ET12567) was digested with Xbal and the 1 .4 kb 
fragment was isolated and ligated with the Xbal digested vector fragment of 
20 pSG1448/27/P5/2 (pSG 1 AAangAIangAIIorfl4). The ligation was used to transform E. coli 

DH10B and plasmid pSG 1448/2 7/95/21/4 (pSGlA4angAIangAIIorfl4angMIII) was isolated 
using standard protocols. The construct was verified with restriction digests and sequence 
analysis. 

25 Isolation ofplasmid pSGl 448/27/95/2 1/44/1 9 ( P SG144angAIangAIIorfl4angMIIIangB) 

Plasmid pSGLit2¥//P (isolated from E. coli ET12567) was digested with Xbal and the 1.2 kb 
fragment was isolated and ligated with the Xbal digested vector fragment of 
pSG 1 448/2 7/95/21/4 (pSG 1 AAangAIangAIIorfl4angMIII). The ligation was used to transform 
E. coli DH10B and plasmid pSGlAA8/27/95/2 1/44/1 9 

30 (pSG\AAangAIangAIIorfl4angMIIIangE) was isolated using standard protocols. The 
construct was verified with restriction digests and sequence analysis. 

Isolation ofplasmid pSGl 448/27/95/2 1/44/1 93/6eryCIII 
(pSG144angAIangAIIorfl4angMIIangBangMeryCIII) 
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Plasmid pSG 1 445/2 7/91/44/1 93/6eryCIIIno9 was digested with BgRl and the about 2 kb 
fragment was isolated and ligated with the BglR digested vector fragment of 
pSGl448/27/95/2 1/44/1 9 ( v SG\44angAIangAIIorfl4angMIIIangB). The ligation was used to 
transform E. coli DH10B and plasmid pSGl 448/27/95/2 1/44/1 93/6eryCIII 
(pSGl44angAIangAIIorfl4angmiIangBangMIeryCIII) was isolated using standard 
protocols. The construct was verified with restriction digests and sequence analysis. 
EryCIII carries a to-tag fusion at the end. The construct was used to transform S. erythraea 
SGQ2 using standard procedures. 

Bioconversion of 3-O-mycarosyl erythronolide B to 5-0-dedesosaminyl-5-0-mycaminosyl 
erythromycin A 

The S. erythraea strain SGQ2pSG\448/27/95/21/44/193/6eryCIII was grown and 
bioconversions with fed 3-O-mycarosyl erythronolide B were performed as described in the 
General Methods. The cultures were analysed and improved amounts of a compound with m/z 
750 was detected consistent with the presence of 5-0-dedesosaminyl-5-0-mycaminosyl 
erythromycin A. Similar results were obtained with the S. erythraea strain Q42/1 containing 
the gene cassette pSG144S/2 7/9 5/2 1/44/1 93/6eryCIIL 

16 mg of the compound with m/z 750 was purified and the structure of 5-0-dedesosaminyl-5- 
O-mycaminosyl erythromycin A was confirmed by NMR analysis (See Table I and Figure 1). 

Table II: {Hand 13 C NMR data for 5-0-dedesosammyl-5-0-mvcaminosyl erythromycin A 
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Position 


5 H 


Multiplicity 


Coupling 
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a This carbon was assigned from the HMQC spectrum 



Example 4: Isolation of mycaminosyl tylactone 

Isolation of plasmid pSGl 448/27/95/2 1/44/1 93/6tylMII 
5 (pSG144angAIangAIIorfl4angMIIangB3/6tylMII) 

Plasmid pSG 1 448/2 7/9 1/44/1 93/6tylMII no9 was digested with BgRl and the about 2 kb 
fragment was isolated and ligated with the BgUI digested vector fragment of 
pSG 1 448/2 7/95/21/44/1 9 (pSGU4angAIangAIIorfl4angMIIIangB). The ligation was used to 
transform E. coli DH10B and plasmid pSG 1 448/2 7/95/2 1/44/1 93/6tylMTI 
10 (pSG 1 44angAIangAIIorfl4angMIIIangBangMItylMII) was isolated using standard protocols. 
The construct was verified with restriction digests and sequence analysis. 
TylMII carries a /zw-tag fusion at the end. 

Bioconversion of tylactone to mycaminosyl tylactone 
1 5 The S. erythraea strain Q42/ 1 pSG 1 448/2 7/95/2 1/44/1 93/6tylMII is grown and bioconversions 
with fed tylactone is performed as described in the General Methods. The cultures are 
analysed and a compound with m/z 568 is detected consistent with the presence of 
mycaminosyl tylactone. 
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Example 5: Isolation of 5-0-dedesosaminyl-5-0-angolosaminyl erythromycins using 
gene cassette pSGU48/27/91/4spn05/2p4/193/6tylMII by bioconversion of 3-0-mycarosyl 
erythronolide B. 

Isolation of plasmid conv nol 

For the multiple use of promoter sequences in acf-controlled gene cassettes a 240 bp fragment 
was amplified by PGR using the primers BIOSG78 5'- 

GGG CATATGT GTCCTCCTTAATTAATCGATGCGTTCGTCC-3 1 and BIOSG79 5'- 
GG AGATCT GGTCTAGATCGTGTTCCCCTCCCTGCCTCGTGGTCCCTCACGC -3' and 
plasmid pSG142 (Gaisser et aU 2000) as template. The 0.2 kb PGR fragment (PGR no5) was 
cloned using standard procedures and EcoRV digested plasmid Litmus28. Plasmid conv nol 
was isolated. The construct was verified by sequence analysis. 

Isolation of pSGLU3religl 

Plasmid conv no 1 was digested with NdeVBglil and the about 0.2 kb fragment was isolated 
and ligated with the BamULINdel digested vector fragment of pSGLit2. The ligation was used 
to transform E. coli DH10B and plasmid pSGLit3religl was isolated using standard 
procedures. This construct was verified using restriction digests and sequence analysis. 

Isolation of plasmid pSGlU34/19 

Plasmid IAX4/19 was digested with NdeVXbal and the 1 .2 kb fragment was isolated and 
ligated to NdeVXbal digested DNA of pSGLitS. The ligation was used to transform E. coli 
ET12567 and plasmid pSGLit34/79 no23 was isolated. This construct was digested wWhXbal 
and the isolated 1.4 kb fragment was used for the assembly of gene cassettes. 

Cloning of orf4 by isolating plasmid Lit6/4 

The gene orf4 was amplified by PCR using the primers BIOSG75 5 f - 
GGG CATATGA GCACCCCTTCCGCACCACCCGTTCCG-3' and BIOSG76 5'- 
GG TCTAGA GGTCAGTACAGCGTGTGGGCACACGCCACCAG-3' and cosmid4H2 
containing a fragment of the angolamycin biosynthetic pathway was used as template. The 2.5 
kb PCR fragment (PCR no6) was cloned using standard procedures and EcoRY digested 
plasmid Litmus28. Plasmid LH6/4 was isolated with an Ndel site overlapping the start codon 
of off 4 and anXbal site following the stop codon. The construct was verified by sequence 
analysis. 
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Isolation ofplasmid pSGlit26/4 no9 

Plasmid Ut6/4 was digested with NdeVXbal and the DNA was isolated and ligated to 
NdeVXbal digested DNA of pSGLit2. The ligation was used to transform E. coli ET 12567 
and plasmid pSGLit26/4 no9 was isolated. This construct was confirmed by restriction digests 
5 and sequence analysis. 

Cloning of spnO by isolating plasmid pUC19spnO 

The gene spnO from the spinosyn biosynthetic gene cluster of Saccharopolyspora spinosa 
was amplified by PGR using the primers BIOSG41 5'- 

1 0 GGGCATATG AGC AGTTCTGTCGAAGCTGAGGC AAGTG-3 ' and BIOSG42 5 f - 

GGTCTAGAGGTCATCGCCCCAACGCCCACAAGCTATGCAGG--3 t and genomic DNA 
of S. spinosa as template. The about 1.5 kb PGR fragment was cloned using standard 
procedures and Smal digested plasmid pUC19. Plasmid pUC19spnO no2 was isolated with an 
Ndel site overlapping the start codon of spnO and anXbal site following the stop codon. The 

15 construct was verified by sequence analysis. 

Isolation of plasmid pSGlitlspnO no4 

Plasmid p\JC\9spnO was digested with NdeVXbal and the 1.5 kb fragment was isolated and 
ligated to NdeVXbal digested DNA of pSGLit2. The ligation was used to transform E. coli 
20 ET12567 and plasmid pSGLit2s/wO no 4 was isolated using standard procedures. This 

construct was digested with Xbal and the isolated 1.5 kb fragment was used for the assembly 
of gene cassettes. 

Isolation ofplasmid pSGl 448/27/9 l/4spnO (pSG144angAIangAIIangMIIIspnO) 
25 Plasmid pSGUtlspnO no4 (isolated from E. coli ET12567) was digested with Xbal and the 
1.5 kb fragment was isolated and ligated with the Xbal digested vector fragment of 
pSGU48/27/91/4 (pSGU4angAIangAIIangMIII). The ligation was used to transform E. coli 
DH10B and plasmid pSG 144S/2 7/91/4spnO (pSG 1 44angAIangAIIangMIIIspnO) was isolated 
using standard protocols. The construct was verified with restriction digests and sequence 
30 analysis. 

Isolation ofplasmid pSGl 448/27/9 l/4spnOS/2 (pSG144angAIangAIIangMIIIspnOangorfl4) 
Plasmid pSGLit2J/2 no24 (isolated from E. coli ET 12567) was digested with Xbal and the 1 
kb fragment was isolated and ligated with the Xbal digested vector fragment of 
35 pSGl448/27/91/4spnO {pSG\44angAIangAIIangMIIIspnO). The ligation was used to 



34 




transform E. coli DH10B and plasmid pSG 1 448/2 7/91/4spnOS/2 

(pSG144angAIangAIIangMIIIspnOangorfl4) was isolated using standard protocols- The 
construct was verified with restriction digests and sequence analysis. 

5 Isolation of plasmid P SG1448/27/91/4spn05/2p4/19 
(pSG144angAIangAIIangMIIIspnOangorfl4pangB) 

Plasmid pSGLit3¥/7P no23 (isolated from E, coli ET12567) was digested WxthXbal and the 
about 1.4 kb fragment was isolated and ligated with the Xbal digested vector fragment of 
pSG 1 448/2 7/91/4spn05/2 (p$G\44angAIangAIIangMIIIspnOangorfl4). The ligation was 
10 used to transform E. coli DH10B and plasmid pSGU48/27/91/4spn05/2p4/19 

(pSG 1 44angAIangAIIangMIIIspnOangorfl4pangE) was isolated using standard protocols. 
The construct was verified with restriction digests and sequence analysis. c p' indicates the 
presence of the promoter region in front of angB to emphasize the presence of multiple 
promoter sites in the construct. 

15 

Isolation of plasmid pSGl 448/27/9 l/4spn05/2p4/193/6eryCIII 
(pSG144angAIangAIIangMIIIspnOorfl4pangBangMIeryCIII) 

Plasmid pSG 1 448/2 7/9 1/44/1 9 3/6eryCIII no9 was digested with Bgtll and the about 2 kb 
fragment was isolated and ligated with the Bglil digested vector fragment of 
20 pSGU48/27/91/4spn05/2p4/19 (pSGU4angAIangAIIangMIIIspnOorfl4pangB). The 
ligation was used to transform E. coli DH10B and plasmid 
pSG 1 445/2 7/91/4spn05/2p4/193/6eryCIII 

(pSGl44angAIangAIIangMIIIspnOorfl4pangBangMIeryCIII) was isolated using standard 
protocols. The construct was verified with restriction digests and sequence analysis. EryCIII 
25 carries a fe-tag fusion at the end. 'p' indicates the presence of the promoter region in front of 
angB to emphasize the presence of multiple promoter sites in the construct. The plasmid 
construct was used to transform mutant strains of S. erythraea using standard procedures. 

Byconversion of 3-O-mycarosyl erythronolide B to 5-0-dedesosaminyl-5-0-angolosaminyl 

3 0 erythromycins 

Strain S. erythraea Q42/lpSGl448/27/91/4spn05/2p4/193/6eryCIIIwas grown and 
bioconversions with fed 3-O-mycarosyl erythronolide B were performed as described in the 
General Methods. The cultures were analysed and peaks with m/z 704, m/z 718 and m/z 734 
consistent with the presence of angolosaminyl erythromycin D, B and A, respectively, were 

35 observed. 
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Example 6: Production of 5-O-angoIosaminyl tylactone 

Isolation ofplasmid pSGl 448/27/9 l/4spn05/2p4/l 9 3/6tylMII 
(pSG144angAIangAIIangMIIIspnOorfl4pangBangMItylMII) 
5 Plasmid pSG 1 448/27/9 1/44/ 1 93 /6iylMII no9 was digested with BgUl and the about 2 kb 
fragment was isolated and ligated with the BgUl digested vector fragment of 
pSGl448/27/91/4spnOS/2p4/19 (pSGUAangAIangAIIangMTIIspnOorfUpangB). The 
ligation was used to transform E. coli DH10B and plasmid 
pSGl448/27/91/4spn05/2p4/I93/6tylMII 
10 (pSG144angMangAIIangMIIIspnOorfl4pangBangMItylMII) was isolated using standard 

protocols. The construct was verified with restriction digests and sequence analysis. TylMII 
carries a fe-tag fusion at the end. The plasmid was used to transform mutant strains of S. 
erythraea applying standard protocols. 'p' indicates the presence of the promoter region in 
front of angB to emphasize the presence of multiple promoter sites in the construct. 

15 

Isolation ofS. erythraea 18AI(BIOT-2634) 

To introduce a deletion comprising the PKS and majority of post PKS genes in S. erythraea a 
region of the left hand side of the ery- cluster (LHS) containing a portion of eryCI, the 
complete ermE gene and a fragment of the eryBI gene were cloned together with a region of 

20 the right hand side of the ery- cluster (RHS) containing a portion of the ery B VII gene, the 
complete eryK gene and a fragment of DNA adjacent to eryK. This construct should enable 
homologous recombination into the genome in both LHS and RHS regions resulting in the 
isolation of a strain containing a deletion between these two regions of DNA. The LHS 
fragment (2201 bp) was PGR amplified using S. erythraea chromosomal DNA as template 

25 and primers BIdelNde (5 5 -CCCATATGACCGGAGTTCGAGGTACGCGGCTTG-3 5 ) and 
BIdelSpe (5 ' -G ATACTAGTCCGCCGACCGC ACGTCGCTGAGCC-3 ' ) . Primer BIdelNde 
contains an Ndel restriction site (underlined) and primer BIdelSpe contains a Spel restriction 
site used for subsequent cloning steps. The PGR product was cloned into the Smal restriction 
site of pUC19, and plasmid pLSB177 was isolated using standard procedures. The construct 

30 was confirmed by sequence analysis. Similarly, RHS (2158 bp) was amplified by PGR using 
S. erythraea chromosomal DNA as template and primers BVIIdelSpe (5 5 - 
TGC ACTAGTGGCCGGGCGCTCG ACGTC ATCGTCGAC AT-3 ' ) and BVIIdelEco (5 5 - 
TC GATATC GTGTCCTGCGGTTTCACCTGCAACGCTG-3'V Primer BVIIdelSpe contains 
a Spel restriction site and primer BVIIdelEco contains an EcoRV restriction site. The PGR 

35 product was cloned into the Smal restriction site of pUC19 in the orientation with Spel 
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positioned adjacent to Kpnl and EcoKV positioned adjacent to Xbal. The plasmid pLSB178 
was isolated and confirmed using sequence analysis. Plasmid pLSB177 was digested with 
Ndel and Spel, the ~2.2kb fragment was isolated and similarly plasmid pLSB178 was 
digested with Ndel and Spel and the -4.6 kb fragment was isolated using standard methods. 
5 Both fragments were ligated and plasmid pLSB 1 88 containing LHS and RHS combined 
together at a Spel site in pUC19 was isolated using standard protocols. An NdellXbal 
fragment (-4.4 kbp) from pLSB188 was isolated and ligated with Spel and Ndel treated 
pC JR24. The ligation was used to transform E. coli DH1 OB and plasmid pLSB 1 89 was 
isolated using standard methods. Plasmid pLSB189 was used to transform S. erythraea P2338 

10 and transformants were selected using thiostrepton. S. erythraea Del 18 was isolated and 
inoculated into 6 ml TSB medium and grown for 2 days. A 5% inoculum was used to 
subculture this strain 3 times. 100 \x\ of the final culture were used to plate onto R2T20 agar 
followed by an incubation at 30°C to allow sporulation. Spores were harvested, filtered, 
diluted and plated onto R2T20 agar using standard procedures. Colonies were replica plated 

1 5 onto R2T20 plates with and without addition of thiostrepton. Colonies that could no longer 
grow on thiostrepton were selected and further grown in TSB medium. S. erythraea 18A1 
was isolated and confirmed using PCR and Southern blot analysis. The strain was designated 
LB-1 /BIOT-2634. For further analysis, the production of erythromycin was assessed as 
described in General Methods and the lack of erythromycin production was confirmed. In 

20 bioconversion assays this strain did not further process fed erythronolide B and erythromycin 
D was hydroxy lated at C 12 to give erythromycin C as expected, indicating that EryK was still 
functional. 

Bioconversion oftylactone toS-O- angolosaminyl tylactone 
25 Strain S. erythraea SGQ2pSGl448/27/91/4spn05/2p4/193/6tylMIIvsas grown and 

bioconversions with fed tylactone were performed as described in the General Methods. The 
cultures were extracted and analysed. A compound consistent with the presence of 
angolosaminyl tylactone was detected. 20 mg of this compound were purified and the 
structure was confirmed by NMR analysis. A compound consistent with the presence of 
30 angolosaminyl tylactone was also obtained when the gene cassette 

pSG 1 448/2 7/91/4spn05/2p4/193/6tylMII was expressed in the S. etythraea strain Q42/1 or S. 
erythraea 18A1. 

Table III: NMR data for 5-Q- f3D angolosaminyl Tylactone 
# 5^ S H (mult., Hz) COSY H-H HMBC H-C~ 
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Isolation of plasmid pSGl 448/27/9 !/4spnOp5/2 
(pSGl 44angAIangAIIangMIIIspnOpangorfl 4) 

Plasmid pSGLit3 5/2 (isolated from is. co// ET12567) was digested with^Sal and the insert 
fragment was isolated and ligated with the Xbal digested vector fragment of 
pSG 1 445/2 7/91/4spnO (pSG 1 AAangAIangAIIangMIIIspnO). The ligation was used to 
transform E. coli DH10B and plasmid p SG 1 448/2 7/91/4spnOp5/2 
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(pSGl44angAIangAIIangMIIIspnOpangorfl4) was isolated using standard protocols. The 
construct was verified with restriction digests and sequence analysis. 

Isolation ofplasmid pSGl 448/2 7/91/4spnOp5/24/19 
(pSG144angAIangAIIangMIIspnOpangorfl4angB) 

Plasmid pSGLit24/79 (isolated from E. coli ET12567) was digested with Xbal and the insert 
fragment was isolated and ligated with the Xbal digested vector fragment of 
pSG 1 445/2 7/91/4spnOp5/2 (pSGl44angAIangAIIangMIIIspnOpangorf!4). The ligation was 
used to transform E. coli DH10B and plasmid pSGl448/27/91/4spnOp5/24/19 
(p SG 1 44angAIangAIIangMIIIspnOpangorfl 4angB) was isolated using standard protocols. 
The construct was verified with restriction digests and sequence analysis. 

Isolation ofplasmid pSGl 448/27 /91/4spnOp5/24/ 193/6 
(pSG144angAIangAIIangMIIIspnOpangorfl4angBangMI) 
1 5 Plasmid pSGLit2J/<5 (isolated from E. coli ET12567) was digested with Xbal and the insert 
fragment was isolated and ligated with the Xbal digested vector fragment of 
pSG\448/27/91/4spnOp5/24/19 (pSG 1 44angAIangAIIangMHIspnOpangorfl4angB). The 
ligation was used to transform E. coli DH10B and plasmid pSGl 448/2 7/91/4spnOp5/24/193/6 
(pSG 1 44angAIangAJIangMIIIspnOpangorfl4angBangMI) was isolated using standard 
20 protocols. The construct was verified with restriction digests and sequence analysis. 

Isolation ofplasmid P SG1448/27/91/4spnOp5/24/193/6angMII 
(pSG144angAIangAIIangMIIspnOpangorfl4angBangMIangMII) 

Plasmid pSGLitlarcgMZ/ (isolated from E. coli ET12567) was digested with Xbal/Bglll and 
25 the insert fragment was isolated and ligated with the Xbal and partial BgKl digested vector 
fragment of pSGl448/27/91/4spnOp5/24/193/6 

(pSG 1 44angAIangAIIangMIIIspnOpangorfl4angBangMI). The ligation was used to 
transform E. coli DH10B and plasmid pSG\448/27/91/4spnOp5/24/193/6angMlI 
(pSG 1 44angAIangAIIangMIIIspnOpangorfl4angBangMIangMII) was isolated using 
30 standard protocols. The construct was verified with restriction digests and sequence analysis. 
The plasmid was used to transform mutant strains of S. erythraea with standard procedures. 

Biotransformation using S. erythraea Q42/1 P SG1448/27/91/4spnOp5/24/193/6angMII 
(pSG144angAIangAIIangMIIspnOpangorfl4angBangMIangMII) 
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Biotransformation experiments feeding tylactone are carried out as described in General 
Methods and the cultures are analysed. Angolosaminyl tylactone is detected. 

Isolation of plasmidpSG 1448/27/96/4 (pSG144angAIangAIIangorf4) 
5 Plasmid pSG1448/27/9 (pSGl44angAIangAII) was digested with Xbal and treated with 
alkaline phosphatase using standard protocols. The vector fragment was used for ligations 
with Xbal treated plasmid pSGLit2tf/4 no9 followed by transformations of E. coli DH10B 
using standard protocols. Plasmid pSGl448/27/96/4 (pSGl44angAIangAIIangorf4) was 
isolated using standard procedures and the construct was confirmed by restriction digests and 
1 0 sequence analysis. 

Isolation of plasmid pSG 1448/2 7/96/4p5/2 (pSG144angAIangAIIangorf4pangorfl4) 
Plasmid pSGLitS 5/2 (isolated from E. coli ET12567) was digested with Xbal and the insert 
fragment was isolated and ligated with the Xbal digested vector fragment of 
15 pSG 1 448/2 7/96/4 (pSG 1 44angAIangAIIangorf4). The ligation was used to transform E. coli 
DH10B and plasmid pSGU48/27/96/4p5/2 (pSGl44angAIangAIIangorf4pangorfl4) was 
isolated using standard protocols. The construct was verified with restriction digests and 
sequence analysis. 

20 Isolation of plasmid pSGl 448/2 7/96/4p5/21/4 

(pSG144angAIangAIIangorf4pangorfl4angMIII) 

Plasmid pSGLit27/4 (isolated from E. coli ET12567) was digested with Xbal and the 1.4 kb 
fragment was isolated and ligated with the Xbal digested vector fragment of 
pSG 1 448/2 7/96/4p5/2 (pSGl44angAIangAIIangorf4pangorf!4). The ligation was used to 
25 transform E. coli DH10B and plasmid pSG 1 445/2 7/96/4p5/21/4 

(pSG 1 44angAIangAIIangorf4pangorfl4angMIII) was isolated using standard protocols. The 
construct was verified with restriction digests and sequence analysis. 

Isolation of plasmid pSG 1448/2 7/96/4p5/2 1/44/1 9 
30 (pSG144angAIangAIIangorf4pangorfl4angMIIIangB) 

Plasmid pSGLit2^/79 (isolated from E. coli ET12567) was digested with Xbal and the 1.4 kb 
fragment was isolated and ligated with the Xbal digested vector fragment of 
pSGl448/27/96/4p5/21/4 (p$G\44angAIangAIIangorf4pangorfl4angMIII). The ligation was 
used to transform E. coli DH10B and plasmid pSGl 448/2 7/96/4p5/2 1/44/19 
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(pSGMAangAIangAIIangorflpangorfUangMIIIangB) was isolated using standard protocols. 
The construct was verified with restriction digests and sequence analysis. 



Isolation ofplasmid P SG1448/27/96/4p5/21/44/193/6angMII 
(pSG144angAIangAIIangorf4pangorfl4angMIIangBangMIangMII) 

Plasmid P SGlU8/27/9I/4spnOp5/24/193/6angMIIwas digested with BgKl and the about 2.2 
kb fragment was isolated and used to ligate with the BgRl treated vector fragment of 
pSG\448/27/96/4p5/21/44/19. The ligation was used to transform E. coli DH10B using 
standard procedures and plasmid pSG\448/27/96/4p5/21/44/193/6angMII 
(pSGU4angAIangAIIangorf4pangorfl4angMIIIangBangM[angMII) was isolated. The 
construct was verified using restriction digests and sequence analysis. The plasmid was used 
to transform mutant strains of S. etythraea with standard protocols. 

Byconversion oftylactone with S. erythraea Q42/1 P SG1448/27/96/4p5/2 1/44/1 93/6angMII 
(pSG144angAIangAIIcmgorf4pangorfl4angMIIIangBangMIangMII) 
Biotransformation experiments feeding tylactone are carried out as described in General 
Methods and the cultures are analysed. Angolosaminyl tylactone is detected. 

Example 7: Cloning of eryjSTinto the gene cassette pSG144 

Isolation of plasmid pUC19eryK 
To amplify eryK primers eryKl 5'- 

GGTCTAGACTACGCCGACTGCCTCGGCGAGGAGCCC-3' and eryK2: 5'- 
GG CATATGT TCGCCGACGTGGAAACGACCTGCTGCG-5' were used and the PCR 
product was cloned as described for p\JCl9eryCVI. Plasmid pUCl9eryK was isolated. 

Isolation ofplasmid pLSBll 1 (pCJR24eryK) 

Plasmid pUCl9eryK was digested with NdeVXbal and the insert band was ligated with 
NdeVXbal digested pCJR24. Plasmid pLSBl 1 1 (pCJR24ery£) was isolated and the construct 
was verified with restriction digests. 



Isolation of plasmid pLSBl 15 

Plasmid pLSBl 1 1 (pCJR24ery£) was digested with NdeVXbal and the insert fragment was 
isolated and ligated with the NdeVXbal digested vector fragment of plasmid P SGLit2 and 
plasmid pLSBl 15 was isolated using standard protocols. The plasmid was verified using 
restriction digestion and DNA sequence analysis. 
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Isolation ofplasmid pSG 1448/2 7/95/2 l/4eryK 

Plasmid pLSBl 15 from E. coli ET12567 was digested withZ&al and the insert fragment was 
isolated and ligated with theXbal treated vector fragment of pSG 144S/2 7/95/21/4 
(pSGl44angAIangAIIangorf!4angMIII). The ligation was used to transform E. coli DH10B 
with standard procedures and plasmid pSG 1 445/2 7/95/2 l/4eryK 

(pSG 1 44angAIangAIIangorfl4angMIIIeryK) is isolated. The construct is confirmed with 
restriction digests. 

Isolation of plasmid pSGl 448/2 7/95/2 l/4eryK4/ 19 

Plasmid pSGLit24/19 from E. coli ET12567 is digested with^&al and the insert fragment is 
isolated and ligated with the Xbal treated vector fragment ofplasmid 
pSG 1 445/2 7/95/2 l/4eryK. The ligation is used to transform E. coli DH10B with standard 
procedures and plasmid pSG 1 448/2 7/95/2 l/4eryK4/19 

(pSG 1 44angAIangAIIangorfl4angMIIIeryKangE) is isolated. The construct is confirmed 
with restriction digests. 

Isolation ofplasmid pSG 1448/2 7/95/2 l/4eryK4/193/6eryCIII 

Plasmid pSG\448/27/95/2 1/44/1 93/6eryCIII is digested with Bglll and the about 2.1 kb 
fragment is isolated and ligated with the Bglll treated vector fragment of 
pSGU48/27/95/21/4eryK4/19. Plasmid pSGl 448/2 7/95/2 l/4eryK4/193/6eryCIII is isolated 
using standard procedures and the construct is confirmed using restriction digests. The 
plasmid is used to transform mutant strains of S. erythraea with standard methods. 

Bioconversion of 3-O-mycarosyl erythronolide B to 5-0-dedesosaminyl-5-0~mycaminosyl 
erythromycin A 

The S. erythraea strain Q42/lpSG1448/27/95/2 !/4eryK4/193/6eryCIII is grown and 
bioconversions with fed 3-O-mycarosyl erythronolide B are performed as described in the 
General Methods. The cultures are analysed and a compound with m/z 750 is detected 
consistent with the presence of 5-0-dedesosaminyl-5-<9-mycaminosyl erythromycin A. 

Example 8: Production of 13-desethyl-13-methyl-5-0-mycaminosyl erythromycins A 
and B; 13-desethyl-13-isopropyI-5-0-mycaminosyl erythromycin A and B; 13-desethyI- 
13-secbutyI-5-0-mycaminosyI erythromycin A and B 
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Production of 13-desethyl-13-methyl-3-0-mycarosyl erythronolide B, 13-desethyl-13- 
isopropyl-3-O-mycarosyl erythronolide B and 13-desethyl-13~secbutyl-3-0-mycarosyl 
erythronolide B 

Plasmid pLS025, (WO 03/033699) a pCJR24-based plasmid containing the DEBS1, DEBS2 
and DEBS3 genes, in which the loading module of DEBS 1 has been replaced by the loading 
module of the avermectin biosynthetic cluster, was used to transform S. erythraea 
JC2AeryCIII (isolated using techniques and plasmids described previously (Rowe et aL, 1998; 
Gaisser et aL, 2000)) using standard techniques. The transformant JC2AeryCIIIpLS025 was 
isolated and cultures were grown using standard protocols. Cultures of S. erythraea 
JC2AeryCIIIpLS025 are extracted using methods described in the General Methods section 
and the presence of 3-O-mycarosyl erythronolide B, 13-desethyl-13-methyl-3-0-mycarosyl 
erythronolide B, 13-desethyl-13-isopropyl-3-0-mycarosyl erythronolide B and 13-desethyl- 
13-secbutyl-3-0-mycarosyl erythronolide B in the crude extract is verified by LCMS analysis. 

Production of 13-desethyl-13-methyU5-0-dedesosminyU5'0-mycaminosyl erythromycin A 
and B, 13-desethyl-13-isopropyl-5-0-dedesosaminyl-5-0-mycaminosyl erythromycin A and B, 
13-desethyl-13-secbutyl-5-0-dedesosminyl-5-0-tnycaminosyl erythromycin A and B 
Cultures of S. erythraea JC2AeryCIIIpLS025 are extracted using methods described in the 
General Methods section and the crude extracts are dissolved in 5 ml of methanol and 
subsequently fed to culture supernatants of the S. erythraea strain 

SGQ2pSG 1 448/2 7/95/21/44/1 93/6eryCIII using standard techniques. The bioconversion of 
1 3 -desethy 1- 1 3 -methy 1-3 -O-mycarosy 1 erythronolide B, 1 3-desethyl- 1 3 -isopropyl-3-O 
mycarosyl erythronolide B and 13 -desethy 1-1 3 -secbuty 1-3 -O-mycarosyl erythronolide B to 
13-desethyl-13-methyl-5-0-dedesosaminyl-5-0-mycaminosyl erythromycin A and 13- 
desethyl-13-methyl-5-0-dedesosaminyl-5-0-mycaminosyl erythromycin B; 1 3 -desethy 1-1 3- 
isopropyl-5-0-dedesosaminyl-5-O-mycaminosyl erythromycin A and 1 3 -desethy 1-13- 
isopropyl-5-0-dedesosaminyl-5-0-mycaminosyl erythromycin B;l 3-desethyl- 13 -secbuty 1-5- 
O-dedesosaminyl-5-O-mycaminosyl erythromycin A and 1 3 -desethy 1- 1 3 -secbuty 1- 5 - <3- 
dedesosaminyl-5-O-mycaminosyl erythromycin B is verified by LCMS analysis. 

Example 9: 13-desethyl-13-methyl-5-0-dedesosaminyl-5-0-mycaminosyl erythromycin 
A and 13-desethyl-13-methyI-5-0-dedesosaminyl-5-0-mycaminosyl erythromycin B 

Production of 13-desethyl-13-methyl-3-0-mycarosyl erythronolide B 
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Plasmid pIB023 (Patent application no 0125043.0), a pCJR24-based plasmid containing the 
DEBS1, DEBS2 and DEBS3, was used to transform S. erythraea JC2AeryCIII using standard 
techniques. The transformant JC2AeryCIIIpIB023 was isolated and cultures were grown 
using standard protocols, extracted and the crude extract was assayed using methods 
5 described in the General Methods section. The production of 3-0-mycarosyl erythronolide B, 
and 1 3 -desethy 1- 1 3 -methyl-3 - 0-mycarosy 1 erythronolide B is verified by LCMS analysis. 

Production of 1 3-desethyl-l 3-methyl-5-0-dedesosaminyl-5-0-mycaminosyl erythromycin A, 
1 3-desethyl-l 3-methyl-5-0-dedesosaminyl-5-0-mycaminosyl erythromycin B 
1 0 Cultures of S. erythraea JC2AeryCIIIpIB023 are extracted using methods described in the 
General Methods section and the crude extracts are dissolved in 5 ml of methanol and 
subsequently fed to culture supernatants of S. erythraea 

SGQ2pSG 1 44S/2 7/9 5/2 1/44/ 193/6eryCIII using standard techniques. The bioconversion of 
13-desethyl-13-methyl-3-0-mycarosyl erythronolide B to 1 3-desethyl-l 3 -methy 1-5-0- 
1 5 dedesosaminyl-5 -0-mycaminosyl erythromycin A and 1 3 -desethyl- 1 3 -methy 1-5 -0- 
dedesosaminyl-5-0-mycaminosyl erythromycin B are verified by LCMS analysis. 

Example 10: Production of 5-0-dedesosaminyI-5-0-mycaminosyl azithromycin 

20 Azithromycin aglycones were prepared using methods described in EP 10241 45 A2 (Pfizer 

Products Inc. Groton, Connecticut). The S. erythraea strain SGT2pSG142 was isolated using 
techniques and plasmid constructs described earlier (Gaisser et al^ 2000). Feeding 
experiments are carried out using methods described previously (Gaisser et aL, 2000) with the 
S. erythraea mutant SGT2pSG142 thus converting azithromycin aglycone to 3-0-mycarosyl 

25 azithronolide. Biotransformation experiments are carried out using S. erythraea 

SGQ2pSG 1 448/2 7/95/2 l/44/193/6eryCIII and crude extracts containing 3-0-mycarosyl 
azithronolide are added using standard microbiological techniques. The bioconversion of 3-0- 
mycarosyl azithronolide to 5-0-dedesosaminyl-5-0-mycaminosyl azithromycin is verified by 
LCMS analysis. 

30 

Example 11: Production of 5-0-dedesosaminyI-5-0-mycaminosyl erythromycin C 



Isolation of the S. erythraea mutant SGP1 (SGQ2AeryG) 

To create a chromosomal deletion in eryG, construct pSGAG3 was isolated as follows: 
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Fragment 1 was amplified using primers BIOSG53 5'- 
G GAATTCG GCCAGGACGCGTGGCTGGTCACCGGCT -3' and 

BIOSG54 5 ' -GG TCT AGAA AGAGCGTGAGCAGGCTCTTCT AC AGCC AGGTCA -3 ' and 
genomic DNA of S. erythraea was used as template. Fragment 2 was amplified using primers 
BIOSG55 5 ' -G GCATGC A GGAAGGAGAGAACCACGATG ACC ACCGACG-3 ' and 
BIOSG56 5 '-G GTCTAGA CACCAGCCGTATCCTTTCTCGGTTCCTCTTGTG-3 ' and 
genomic DNA of S. erythraea was used as template. Both DNA fragments were cloned into 
Smal cut pUC19 using standard techniques, plasmids pUCPCRl and pUCPCR2 were isolated 
and the sequence of the amplified fragments was verified. Plasmid pUCPCRl was digested 
using EcoW/Xbal and the insert band DNA was isolated and cloned into EcdKUXbal digested 
pUC19. Plasmid pSGAGl is isolated using standard methods and digested with SphVXbal 
followed by a ligation with the SphVXbal digested insert fragment of pUCPCR2. Plasmid 
pSGAG2 is isolated using standard procedures, digested with SphVHindlll and ligated with 
the SphVHinATLl fragment of P CJR24 (Rowe et ah, 1998) containing the gene encoding for 
thiostrepton resistance. Plasmid pSGAG3 is isolated and used to delete eryG in the genome of 
S. erythraea strain SGQ2 using methods described previously (Gaisser et al., 1997; Gaisser et 
al., 1998) and the S. erythraea mutant SGP1 (SGQ2AeryG) is created. 

Production of5-0-dedesosaminyl-5-0-mycaminosyl erythromycin C 
The & erythraea strain SGP1 (S. erythraea SGQ2AeryG) is isolated using standard 
techniques and consequently used to transform the cassette construct 
pSG\448/27/95/2 1/44/1 93/6eryCIII as formerly described. The S. erythraea strain 
SGP 1 pSG 1 445/2 7/95/2 1/44/1 93/6eryCHl is isolated and used for biotransformation as 
described in Example 2 and assays are carried out as described above to verify the conversion 
of 3-O-mycarosyl-erythronolide B to 5-0-dedesosaminyl-5-0-mycaminosyl erythromycin C 
by LCMS analysis. 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I 

1 MNDRPRRAMKGIILAGGSGTO^ 50 
51 IjAGI REIQI I SSKDHLiDLFRSLLGEGDRLGLS I SYAEQREPRGI AEAFLI 100 

IMIIIIIIIIIIIIIIIIIIIIIillillllillllMlMIIMIIM 

51 LAG I RE I Q 1 1 S S KDHLDL FRS LLGEGDRLGL S I S YAEQRE PRG I AEAFIjI 100 
101 GARHIGGDDAALILGDNVFHGPGFSSVLTGTVARLDGCELFGYPVKDAHR 150 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 

101 GARH I GGDDAAL I LGDNVFHGPGFS S VLTGT VARLDGCELFG YP VKDAHR 150 
151 YGVGE I DSGGRLLSLEEKPRRPR^NLAVTGLiYLYTNDVVE I ART I S PSAR 2 00 

II I II I MM I 111 MM IN I Mill MIMI I MM MM 

151 YGVGEIDSGGRLLSLEEKPRRPLEP.GRHRLYL.YTNDWEIARTISPSAR 199 
2 01 GELEITDWKVYIiEQGR ARLTELGRGFA WLDMGTHDSLIiQAGQYVQLLEQ 250 

I II Mil II 1 1 II Mill I I III I II I Mill I II MM II I 

2 00 GELiE I TDVNKVYLEQGRA . AHGAGAWAWLDMGTHDSLLQAGQYVQLLEQ 248 
251 RQGERI AC I EE I AMRMGF I S AEQC YRLGQELRS S S YGS YI I DVAMRGAAA 300 

I II 1 1 II II 1 1 1 1 1 1 II I II M II II 1 1 II 1 1 M I M II 1 1 II II 1 1 1 1 1 

249 RQGERI AC I EE I AMRMGFI SAEQC YRLGQELRS S S YGS YI I DVAMRGAAA 298 
301 DSRAQ 305 

Mill 

299 DSRAQ 303 
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Figure 4 

TylAII.pep x u08223.em_pro2 



1 MRVLVTGGAGFIGSHFTGQLIiTGAyPDLGATRTVVLDKLTYAGNPANLEH 50 

IIIIIIMIIIIMIIIIIIIIIIIIIIMMMIIIMIIIIMIIIM 

1 MRVLVTGGAGFIGSHFTGQLLTGAYPDLGATRTWLDKLTYAGNPANLiEH 50 

. . • * 

51 VAGHPDLiE FVRGD lADQAXAfRRLMEGVGLVVHFAAESHVDRS I ESSEAFV 100 

MINIM 1 1 III I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 nnn 

51 VAGHPDLE FVRGD I ADHGWWRRLMEGVGLVVHFAAE SHVDRS I E SSEAFV 100 
101 RTNVEGTRVLLQAAVDAGVGRFVHISTDEVYGSIAEGSWPEDHPLAPNSP 150 

MIMMM Ml MIMI I II Ml III Ml I MM MIMM 1 MM I M 

101 RTNVEGTRVLLQAAVDAGVGRFVHISTDEVYGSIAEGSWPEDHPVAPNSP 150 

151 YAATKAASDLLAI^YHRTYGLDVRVTRCSlSrNYGPRQYPEKAVPLFTTNLL 200 

1 1 1 1 1 I I 1 1 I 1 1 1 I 1 1 1 I 1 1 I I 1 1 1 I I 1 I 1 1 1 I 1 I I I I 1 1 I 1 1 1 1 > J J onn 
151 YAATKAASDLLAI.AYHRTYGLDVRVTRCSNNYGPRQYPEKAVPLFTTNLL 200 

2 01 DGLPVPLYGDGGNTREWLHVDDHCRGVALVAAGGRPGVIYNIGGGTELTN 2 50 

Ml III II II I II 1 1 II I MM III I II 1 1 M I M 1 1 MM II I II I M 

2 01 DGLPVPLYGDGGNTREWLHVDDHCRGVAL.VGAGGRPGVI YNI GGGTELTN 250 
251 AELTDRI LELCGADRS AVRRVADRPGHDRRYS VDTTKI REELiGYAPRTGI 300 

1 1 1 1 1 1 II 1 1 1 II II II • I II II II II II II I II 1 1 M II II 1 1 1 1 M II 

251 AELTDRI liELCGADRSAIjRRVADRPGHDRRYSVDTTKI REELGYAPRTGI 3 00 
301 TEGIiAGTVAWYRDNRAWWEPLiKRSPGGRELERA 333 

MINIM MIMIMI III IIIIIIMMIII 

3 01 TEGIiAGTVAWYRDNRAWWEPLKRSPGGRELERA 333 



* 
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Figure 7 

1 GGCATGCCTT CGGGGTGTGC GGCGGCGCCT CAGAGCGTGG CCAGTACCTC 
51 GTGCAGGGCC GCGATCACCT TGTCCTGTAC GTCGGGCGCG AGCCCCGGGT 
101 ACATCGGCAG CGAGAAGATC TCGTCCGCCA GCCGCTCCGT CACCGGCAGC 
151 GAGCCCTTGG CGTACCCCAG GTGCGCGAAG CCCGTCATGG TGTGCACGGG 
201 CCACGGGTAA CTGATGTTGA GCGAGATCCC GTACGACTTG AGCGCCTCGA 
251 TGATGTCGTC CCGGCGCGGG TGGCGGACGA CGTACACGTA ATACACGTGG 
301 TCGTTGCCCT CGGTGACGGA CGGCAGCACC AGGCCGCCGG GGCCCGTCAG 
351 GTTCGCGAGT CCTTCGGCGT AACGCCGGGC GACCGCGCGC CGGCCCTCGA 
401 TGTAGCGGTC GAGGCGGGTG AGCTTGCGGC GCAGGATCTC CGCCTGCACC 
451 TCGTCGAGCC GGCTGTTGTG GCCGGGCGTC TGCACGACGT AGTACACGTC 
501 CTCCATGCCG TAGTAGCGCA GCCGGCGCAG CGCACGGTCG ACGTCCGCGT 
551 CGTCGGTCAG CACGGCCCCG CCGTCGCCGT AC GCACCGAG GACCTTCGTC 
601 GGGTAGAACG AGAAGGCGGC GGCGTCGCCC AGCGTGCCGG CCAGCTCGCC 
651 GTGGTGGCGG GCACCGTGCG CCTGGGCGCA GTCCTCCAGC ACCACCAGGC 
701 CGTGCTGCTC GGCCAGGGCG CGCAAGGGCG CCATGT CGAC GCACTGCCCG 
751 TACAGGTGCA CCGGCAGCAG GGCCTTCGTG CGCGGGGTGA TGACGTCCGC 
801 GACCTGGTCG GTGTCCATGA GGTGGTCCTC GGCGCGGACG TCGACGAAGA 
851 CGGGCGTGGC ACCGGTGCCG TCGATGGCCA CCACCGTCGG CGCGGCCGTG 
901 TTGGAGACGG TGACGACCTC GTCCCCCGGG CCCACCCCGA GCGCCTGCAG 
951 ACCCAGCTTG ACGGCGTTGG TGCCGTTGTC GACACCGCCG CAGTGGCGCA 
1001 GGCCGTGGTA GTCCGCGAAC TCCTTCTCGA ACCCGTCCAC GCTGGGGCCG 
1051 AGGACCAACT GCCCGGAGGC GAAGACGGTC TCGACGGCGT CGAGGAGGTC 
1101 CGCGCGTTCG TTCTGGTATT CCGCCAGGTA GTCCCAGACG TAGGTAGTCA 
1151 CGGAGAGCTC AACCTCCAGA GTGTTTCGAT GGGGTGGTGG GAAGCCGGTG 
1201 CGCGCGGACC AGGTCGTGCC AGCAGTCGCG GACCGACTCC CGCAGCGAAC 
1251 GGCGCGGTGC CCAGCCCAGC AGGGCGCGCG CCGCGCCGGT GTCGACCCGC 
1301 AGCCAGTCCT CCCGGTGCCC GGGAGCCCGG CCCGGAGCCG GGCGCTCCAC 
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1351 


CACCCGCGCC 


GGAATGCCGC 


TCGCCTCGAT 


GAACAGGCCG 


ACCAGGTCGC 


1401 


GGACGGCGAC 


CGCCTCGCCC 


CGCCCGATGC 


CGACGGCGAC 


CGGGACGGCC 


1451 


GGTGCGCGGG 


CGGCGGCCAC 


GACGGCGTCG 


GCCACGTCCC 


GCACATCGAC 


1501 


GTAGTCCCGG 


TGCGCGCGCA 


GCCGGGACAG 


TTCCACGACG 


GCCTCCGCAC 


1551 


CCGTCCCGGC 


GGCGGCCAGC 


AGCCGCTCGG 


CGACCTGGCC 


CAGCAGACTG 


1601 


ATCCGCGGGG 


TGCCGGGGCC 


CGACACGTTG 


GACACCCGTA 


GCACCACACC 


1651 


GTCGACCCAC 


CCGCCCGAGG 


TGCCCCGCAG 


CACCGCCTCG 


CTGGCGGCGA 


1701 


GCTTGCTCCT 


GCCGTACGCC 


GTGTCCGGGC 


GCGGTACGGC 


GTCGGCGCCC 


1751 


ACCGAACCGC 


CGGGCGTCAC 


CGGGCCGTAC 


TCCAGTACCG 


AGCCGAGGTG 


1801 


GACCAGCCGC 


GGCCGCGCGG 


ACATCAGCGC 


CAGCGCCTCC 


AGCAGGCGCA 


1851 


GCGTGGGCAC 


CGCGGTGGCG 


GACCACATCT 


GCTCGTCGGT 


ACGGCCCCAG 


1901 


ATGCTTCCGA 


CGGAGTTGAC 


GATCGTGTCC 


GGACGCTCCG 


CGTCCAGGGC 


1951 


GGCGGCCAGC 


GCCGCGGGAT 


CCGTACCGGC 


CAGGTCCAGG 


GTGACGCAGC 


2001 


GGTACGGCAT 


CGGCTCCTCG 


GGCGGGCGGC 


GGCCCACCAC 


CACCACGTCA 


2051 


CGGCCCCGCG 


CGGCGAACGC 


CGCGCACACA 


TGCCGGCCGA 


CGTACCCGGC 


2101 


GCCGCCCAGG 


ACCACGACGC 


TGCCACTGCC 


ACTGCCGCGC 


GGCATCGGAT 


2151 


CGTTCACCAT 
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ATPPAPTPPA 


PPAPPAPACG 


CGCCCGTCGC 


GGCCCCATGA 


ACAGGTCCAG 


12251 


AT APPPPATP 


TPGPGCCCGC 


GGTGCACCCC 


GGTGAAGTTG 


CTCCGGGTGG 


12301 


CCTGCACGGT 


CGGCGACACC 


TGAAGAACGT 


TGACGTTCCC 


GGGCTCCATC 


12351 


TTGGCCTGCA 


TCAGGAAGTG 


CAGCACCCCG 


TCGATCTCCC 


GC GCC AC GAT 


12401 


CCCGAGCAGC 


CCCACCTCCG 


GCTGCACGAT 


GATGGGCTGC 


GTCCAGCCCC 


12451 


GCTCGGGCAG 


CCGGTCCGTA 


CGGACGTGCA 


GCCCCTCCAC 


GGAGAAGAAA 


12501 


CGGCCCGACG 


CGTGGTGCAG 


GTTTCCCGTA 


CCCGGGTGGA 


AGCTCCAGCC 


12551 


GCGCAGCTCC 


GCGAAGGGAA 


CGCGGGACAC 


GTCGAAGCGC 


CCCGCCCGCA 


12601 


GGCGTTCGGC 


CAGCCAGCCG 


GAGATGCCGT 


CGAACGGCGT 


GACCGCACTG 
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12651 TCCGCGGTGC GTGCCGACAC CAGCACCCGC CGCGCCGTGT CCACCGGGTC 
12701 ACCGGGCCGG ACCGCGTCCG CACGGCGCCG CGCGGCGCCG TGCGGGGCGG 
127 51 GGGCGGATCG CGGCGGTACG GGTTCGCGGG CGGTGTCCGC GGCGGTGCGC 
12801 GGCGGGACGG GGCCGGTGCT CGTGTCCGCG GCGGTACGCG GTGGGACGGT 
12851 CCCGGTGGCC GTGTCCGCGG TGGCCGTGCC GGCGAGGGCG TCGCCGATGG 
12901 TCCGGCACAC CTCGTCCATC CGGTCGTTCA GATAGAAGTG ACCGCCGGCG 
12951 AAGGTGTGCA GGGCGAAGGG GCCCGTGGTC AGCTCCCGCC AGGCCCTCGC 
13001 CTCCTCCAGC GGGACATCGG GATCACGGTC ACCGGTGAGC ACCGTGACCG 
13051 GACAGT CCAG CGCACCGCCG GGCACATACG CGTACGTGCC CGCCGCCCGG 
13101 TAGTCGTTGC GGATCGCCGG CAGGGCCAGC CGCAGCAGCT CCTCGTCCTG 
13151 GAGGACGGCG TCCTCGGTGC CCTGAAGCGT GGCGATCTCC GCGATCAGCG 
13201 CGTCGTCGTC GAGGAGGTGG GCGACGTCCC GCCGGCGCAC CGTCGGCGCA 
13251 CGGCGGCCCG ACAC CAGCAG ATGGACGGGG GAGGCCTGCC CGGAACCGCG 
13301 CAGCCGGCGC GCGACCTCGA ACGCCACCGT GGCACCCATG CTGTGCCCGA 
13351 ACAGCGCGAG CGGACGGTCG GCCCAGCGCA GGATCTCCGG CACCACCTGG 
134 01 TCCACCAGGC CCGATATGGA CGGGATGAAC GGCTCGTGCC GGCGGTCCTG 
13451 GCGGCCCGGG TACTGCACCG CCAGCGCCTC CACGGTCTCG TCCAGTCCGC 
13501 GTGCCAGGGC GGCGAAGGAG GTCGCGGCGC CACCGGCGTG CGGGAAGCAG 
13551 ACCAGACGCA GTTCCGGATC CCGCACCGGG CGGTAACGGC GGACCCACAG 
13601 ACCCTCGTCC GGGTGTCCGG CCGGCGACGG GGCTCCCGGA ACGGGTGGTG 
13651 CGGAAGGGGT GCTCACGGCG GATCCAGCTC CTCGCGGTCG GGGGGACCGC 
13701 TGTCGGGGAC GGCACGTCGG GTGCGGACGT CGGGTACGGG CGTCGGGGCG 
13751 TGACGGGGAG GGACGGGGCG GTCGGTCAGT CGGTGCGCCG GGCCTCCTGC 
138 01 GCGGCCTTCT TCAGCGGTTC CCACCACGCG CGGTTCTCCG CGTACCAGCG 
13851 CACCGTGTCC GCCAGGCCCG TCGTGAAGTC CGTACGCGGG GCATAGCCCA 
13901 GCTCGCCCGT GATCTTGCCG ATGTCCAGCG CGTACCGCAG GTCGTGCCCC 
13951 GGCCGGTCGG CGACGTGGCG CACCGACGAG GCGTCGGCAC CGCACAGCCC 
14001 GAGCAGCCGC TTCGTCAGCT CCCGGTTGGT CAGCTCCGTC CCGCCACCGA 
14 051 TGTGGTAGAC CTCGCCCGGG CGCCCGCGGG TCGCCACCAG GCTGATCCCG 
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14101 CGGCAGTGGT CGTCCACGTG CAGCCAGTCC CGGCTGTTGC CGCCGTCGCT 
14151 GTACAGCGGC ACCGTCAGAC CGTCCAACAG GTTCGTGGCG AAGAGCGGGA 
14201 CGACCTTCTC GGGGTGCTGG TACGGGCCGT AGTTGTTGGA GCACCGGGTG 
14251 ACGAGGAGCG GCAGGCCGTA CGTCCGGTGG TAGGCCAGCG CCAGGAGGTC 
14 301 CGACGCCGGC TTCGAGGCGG CGTACGGGGA GTTCGGCGCC AGCGGCTGCT 
14351 CCTCGCGCCA CGACCCCTCG GCGATCGAGC CGTACACCTC GTCCGTGGAG 
14 401 ACGTGGACGA ACCGGCCGGC CCCCGCCTCC ACCGCGGCCT GCAAGAGGAC 
14 451 TTGCGTCCGC CGTACGTTCG TCTCGACGAA CGCCGACGCG TCGGCGATGG 
14501 AGCGGTCCAC GTGCGACTCC GCCGCGAAGT GGACCACGAC GTCCGCCCCC 
14551 CGCACGACCC GGGACATCAC CTCCGCGTCC CGGATGTCGG CGTGCACGAA 
14601 CTCCAGCGAC GGATGGTCCG CGACCGGGTC CAGGTTGGCG AGGTTCCCGG 
14 651 CATAGGTCAG CTTGTCGACC ACCACCGTCC GCGCCCCGGC CAGGTCCGGA 
14701 TACGCCCCGG CCAGCAGTTG TCTGACGAAG TGCGAGCCGA TGAAGCCCGC 
14751 ACCTCCGGTG ACCAGCAGCC GCATGGGAGC ACAGACCTTT CTTCCAGGGA 
14801 CGGGAAACGG GGAGGCGGAC GGGGACGGAG GCGAGGGCGG TGGCTATGCG 
14851 GCCGGTCCGG ACATGAGGGT GTCCGCCACG TCCATCAAGT ACCGGCCGTA 
14 901 GCTGGAGCTC TCGAGTTCAC GGCCGAGCTC GTGGCACTGC CGCGCGCTGA 
14 951 TGTACCCCAT CCGCAGGGCG ATCTCCTCGA CGCAGGAGAT CCGCACGCCC 
15001 TGCCGCTGCT CCAGGAGCTG GACGTACTGC CCCGCTTGCA GCAGCGAGCT 
15051 GTGCGTGCCC ATGTCCAGCC AGGCGAACCC GCGCCCCAGT TCCGTCATAC 
15101 GGGCGCGGCC CTGCTCCAGG TACACCTTGT TGACGTCGGT GATCTCCAGC 
15151 TCGCCCCGCG GCGACGGTGT CAGCCGCCGG GCGATGTCCA CCACGCCGTT 
15201 GTCGTAGAAG TACAGCCGCG TCACCGCGAG ATGGGAGCGG GGCTTCTCCG 
15251 GCTTCTCCTC CAGGGACACC AGCCGGCCTT CCGCGTCGAC CTCGCCGACG 
15301 CCGTAGCGCC GGGGGTCCTT CACCGGGTAG CCGAACAGCT CGCAGCCGTC 
15351 CAGCCGCGCC GCGGTGGAGG CCAGCACGGA GGAGAACCCC GGACCGTGGA 
15401 AGACGTTGTC CCCCAGGATG AGGGCGACCG GGTCGTCCCC GATGTGCTCC 
15451 TCGCCGATGA GGAACGCCTC GGCGATGCCC CGGGGCTCCT CCTGCTCGGC 
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15501 GTAGCCGACA CTGATCCCGA 

15551 ACATCTCCAA GTGCGTCTTC 

15601 GCCAGCATGA GCACCGACAG 

15651 CGGCAGCAAC TGCTTGGACA 

15701 CGCTGCCGCC CGCCAGGATG 

157 51 GTCTTCGTCA T 



TGCGGCTGCC GTCGCCCAGC AGCGAACGGA 
GACGTGATGA TCTGGATGTC CCGGATCCCC 
CGGGTAGTAG ATCATGGGCT TGTCGTAGAC 
GTGCCCCGGT CAGGGGGCGC AGGCGCGTGC 
ATGCCCTTCA TGGGCCGCCG GTCCGCCGTC 
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Figure 9 

59800 G 

59801 TGAGCCCCGC ACCCGCCACC GAGGACCCGG CCGCCGCCGG GCGCCGCCTG 
59851 CAACTGACCC GCGCAGCCGA GTGGTTCGCG GGAACCCAGG ACGACCCGTA 
59901 CGCGCTCGTC CTGCGCGCCG AGGCCACCGA CCCGGCCCCG TACGAGGAGC 
59951 GGATCCGGGC CCACGGGCCG CTCTTCCGCA GCGACCTGCT CGACACCTGG 
60001 GTCACGGCGA GCAGGGCCGT CGCCGACGAA GTGATCACCT CACCCGCCTT 
60051 CGACGGGCTC ACGGCCGACG GGCGGCGCCC CGGCGCGGGG GAACTGCCGC 
60101 TGTCCGGCAC CGCGCTCGAC GCGGACCGCG CCACATGCGC ACGGTTCGGG 
60151 GCCCTCACCG CCTGGGGCGG GCCGCTGCTG CCGGCGCCGC ACGAGCGGGC 
60201 GCTGCGCGAG TCCGCCGAAC GGCGGGCCCA CACACTCCTC GACGGGGCGG 
60251 AGGCCGCCCT GGCCGCCGAC GGCACCGTCG ACCTCGTCGA CGCGTACGCC 
60301 CGCAGGCTCC CCGCGCTGGT CCTCCGCGAA CAGCTCGGCG TGCCGGAGGA 
60351 GGCGGCGACC GCCTTCGAGG ACGCGCTGGC CGGCTGCCGC CGCACCCTGG 
604 01 ACGGCGCCCT GTGCCCGCAA CTCCTCCCGG ACGCCGTGGC GGGGGTGCGC 
60451 GCGGAAGCCG CGCTGACCGC CGTGCTGGCC TCCGCCCTGC GCGGGACTCC 
60501 GGCCGGCCGG GCCCCCGACG CCGTCGCCGC CGCCCGCACC CTGGCCGTCG 
60551 CGGCCGCCGA GCCCGCAGCC ACCCTCGTCG GCAACGCCGT ACAGGAGCTG 
60 601 CTGGCGCGTC CCGCGCAGTG GGCGGAGCTC GTACGCGACC CGCGCCTCGC 
60651 GGCCGCCGCG GTGACCGAAA CGCTGCGTGT CGCGCCGCCC GTCCGCCTGG 
60701 AGCGGCGGGT CGCCCGCGAG GACACGGACA TCGCCGGGCA GCGCCTCCCC 

607 51 GCCGGGGGGA GCGTCGTGAT CCTCGTCGCC GCCGTCAACC GCGCGCCCGT 

608 01 ATCCGCGGGA AGCGACGCCT CCACCACCGT CCCGCACGCC GGCGGCCGGC 
60851 CCCGTACCTC CGCCCCCTCC GTCCCCTCAG CCCCCTTCGA CCTCACACGG 
60 901 CCCGTGGCCG CGCCCGGGCC GTTCGGGCTC CCCGGCGACC TGCACTTCCG 
60 951 CCTCGGCGGG CCCCTGGTCG GAACGGTCGC CGAAGCCGCG CTCGGTGCGC 
61001 TGGCCGCACG GCTCCCCGGT CTGCGCGCCG CCGGGCCGGC CGTGCGGCGC 
61051 CGCCGCTCAC CGGTGCTGCA CGGACACGCC CGCCTCCCCG TCGCCGTCGC 
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61101 


CCGGACGGCC 


CGTGACCTGC 


CCGCCACCGC 


ACCGCGGAAC 


TGAGGAGGGA 


61151 


GTGCCCCGAT 


GCGTATCCTG 


CTGACGTCGT 


TCGCGCACAA 


CACGCACTAC 


61201 


TACAACCTGG 


TCCCCCTCGG 


CTGGGCGCTG 


CGCGCCGCCG 


GGCACGACGT 


61251 


ACGGGTCGCC 


AGCCAGCCCT 


CGCTGACCGG 


CACCATCACC 


GGCTCCGGGC 


61301 


TGACCGCCGT 


CCCCGTGGGC 


GACGACACGG 


CCATCGTCGA 


GCTGATCACC 


61351 


GAGATCGGCG 


ACGACCTCGT 


CCTCTACCAG 


CAGGGCATGG 


ACTTCGTGGA 


61401 


CACCCGCGAC 


GAGCCGCTGT 


CCTGGGAACA 


CGCCCTCGGA 


CAGCAGACGA 


61451 


TCATGTCGGC 


CATGTGCTTC 


TCGCCGCTGA 


ACGGC GACAG 


CACCATCGAC 


61501 


GACATGGTGG 


CGCTGGCCCG 


TTCCTGGAAA 


CCGGACCTCG 


TCCTGTGGGA 


61551 


GCCCTTCACC 


TACGCGGGAC 


CCGTCGCCGC 


GCACGCCTGC 


GGCGCCGCCC 


61601 


ACGCCCGGCT 


GCTGTGGGGT 


CCCGACGTGG 


TCCTCAACGC 


ACGGCGGCAG 


61651 


TTCACCCGGC 


TGCTCGCCGA 


GCGCCCCGTC 


GAACAGCGCG 


AGGACCCGGT 


61701 


CGGCGAATGG 


CTCACGTGGA 


CGCTGGAGCG 


CCACGGCCTC 


GCCGCCGACG 


61751 


CGGACACGAT 


CGAGGAACTG 


TTCGCCGGGC 


AGTGGACGAT 


CGACCCCAGC 


61801 


GCCGGGAGCC 


TGCGGCTGCC 


GGTCGACGGC 


GAGGTCGTGC 


CCATGCGCTT 


61851 


CGTGCCGTAC 


AACGGCGCCT 


CGGTCGTCCC 


CGCCTGGCTC 


TCCGAGCCGC 


61901 


CTGCCCGGCC 


CCGGGTCTGC 


GTCACCCTCG 


GCGTCTCCAC 


CCGGGAGACC 


61951 


TACGGCACGG 


ACGGCGTCCC 


GTTCCACGAA 


CTGCTGGCCG 


GACTGGCCGA 


62001 


CGTGGACGCC 


GAGATCGTCG 


CCACCCTCGA 


CGCGGGGCAG 


CTCCCGGACG 


62051 


CCGCCGGTCT 


GCCCGGCAAT 


GTGCGCGTCG 


TCGACTTCGT 


GCCGCTGGAC 


62101 


GCCCTGCTGC 


CGAGCTGCGC 


CGCGATCGTC 


CAC C AC GG AG 


GCGCGGGAAC 


62151 


CTGTTTCACG 


GCCACCGTGC 


ACGGCGTCCC 


GCAGATCGTC 


GTGGCCTCCC 


62201 


TCTGGGACGC 


GCCGCTGAAG 


GCGCACCAAC 


TCGCCGAGGC 


GGGCGCCGGG 


O Z. £L ZJ J_ 


ATCGCCCTGG 


ACCCCGGGGA 


ACTGGGCGTG 


GACACCCTGC 


GCGGCGCCGT 


62301 


CGTGCGGGTG 


CTGGAGAGCC 


GCGAGATGGC 


CGTGGCGGCG 


CGTCGCCTCG 


62351 


CCGACGAGAT 


GCTCGCCGCC 


CCCACCCCGG 


CCGCGCTCGT 


CCCCCGCCTC 


62401 


GAACGCCTCA CCGCCGCGCA 


CCGCCGCGCC 


TGATCCCGCC 


AAGGAGCCCC 


62451 


CATGAACCTC 


GAATACAGCG 


GCGACATCGC 


CCGGTTGTAC 


GACCTGGTCC 


62501 


ACCAGGGAAA GGGCAAGGAC 


TACCGGGCGG 


AGGCCGAGGA 


GCTGGCCGCG 
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62551 


CTTGTCACCC 


AGCGCCGCCC 


CGGGGCCCGC 


TCCCTCCTCG 


ACGTGGCCTG 


62601 


CGGAACGGGG 


ATGCACCTGC 


GGCACCTCGG 


CGACCTCTTC 


GAGGAGGTGG 


62651 


CCGGGGTGGA 


GATGTCCCCC 


GACATGCTGG 


CCATCGCGCA 


GCGGCGCAAC 


62701 


CCGGAGGCCG 


GCATCCACCG 


GGGGGACATG 


CGGGACTTCG 


CCCTCGGCCG 


62751 


CCGCTTCGAC 


GCCGTGATCT 


GCATGTTCAG 


TTCCATCGGG 


CACATGCGCG 


62801 


ACCAGCGGGA 


ACTGGACGCG 


GCGATCGGCC 


GGTTCGCCGC 


GCACCTGCCG 


62851 


TCCGGCGGGG 


TCGTGATCGT 


CGATCCCTGG 


TGGTTCCCGG 


AGACGTTCAC 


62901 


ACCGGGGTAC 


GTCGGCGCGA 


GCCTCGTCGA 


GGCCGAGGGC 


CGCACCATCG 


62951 


CGCGCTTCTC 


CCACTCCGCG 


CTCGAGGACG 


GCGCGACCCG 


GATCGATGTG 


63001 


GACTACCTCG 


TCGGCGTGCC 


GGGGGAGGGG 


GTGCGGCACT 


TGAAGGAGAC 


63051 


CCATCGGATC 


ACGCTTTTCG 


GGCGTGCGCA 


GTACGAGGCG 


GCCTTCACCG 


63101 


CGGCGGGGAT 


GTCCGTCGAG 


TACCTCCCGC 


ACGCCGCCAC 


CGACCGCGGA 


63151 


CTCTTCGTCG 


GCGTCCAGGC 


CTGA 
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Figure 10 

1 MKGIILAGGS GTRLRPLTGA LSKQLLPVYD KPMIYYPLSV LMLAGIRDIQ 
51 IITSKTHLEM FRSLLGDGSR IGISVGYAEQ EEPRGIAEAF LIGEEHIGDD 
101 PVALILGDNV FHGPGFSSVL ASTAARLDGC ELFGYPVKDP RRYGVGEVDA 
151 EGRLVSLEEK PEKPRSHLAV TGLYFYDNGV VDIARRLTPS PRGELEITDV 
201 NKVYLEQGRA RMTELGRGFA WLDMGTHSSL LQAGQYVQLL EQRQGVRISC 
251 VEEIALRMGY ISARQCHELG RELESSSYGR YLMDVAETLM SGPAA 
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Figure 11 

1 MRLLVTGGAG FIGSHFVRQL LAGAYPDLAG ARTVWDKLT YAGNLANLDP 
51 VADHPSLEFV HADIRDAEVM S RWRGADW VHFAAESHVD RSI ADAS AFV 
101 ETNVRGTQVL LQAAVEAGAG RFVHVSTDEV YGSIAEGSWR EEQPLAPNSP 
151 YAASKAASDL LALAYHRTYG LPVWTRCSN NYGPYQHPEK VVPLFATNLL 
201 DGLTVPLYSD GGNSRDWLHV DDHCRGISLV ATRGRPGEVY HIGGGTELTN 
251 RELTKRLLGL CGADASSVRH VADRPGHDLR YALDIGKITG ELGYAPRTDF 
301 TTGLADTVRW YAENRAWWEP LKKAAQEARR TD 
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Figure 12 

1 VSTPSAPPVP GAPSPAGHPD EGLWVRRYRP VRDPELRLVC FPHAGGAATS 

51 FAALARGLDE TVEALAVQYP GRQDRRHEPF IPSISGLVDQ WPEILRWAD 

101 RPLALFGHSM GATVAFEVAR RLRGSGQASP VHLLVSGRRA PTVRRRDVAH 

151 LLDDDALIAE IATLQGTEDA VLQDEELLRL ALPAIRNDYR AAGTYAYVPG 

201 GALDCPVTVL TGDRDPDVPL EEARAWRELT TGPFALHTFA GGHFYLNDRM 

251 DEVCRTIGDA LAGTATADTA TGTVPPRTAA DTSTGPVPPR T AAD TARE P V 

301 PPRSAPAPHG AARRRADAVR PGDPVDTARR VLVSARTADS AVTPFDGISG 

351 WLAERLRAGR FDVSRVPFAE LRGWSFHPGT GNLHHAS GRF FSVEGLHVRT 

4 01 DRLPERGWTQ PIIVQPEVGL LGIVAREIDG VLHFLMQAKM EPGNVNVLQV 

4 51 SPTVQATRSN FTGVHRGRDI RYLDLFMGPR RARVLVDSIQ SEQADWFLAK 

501 RNRNMIVELA ADDDLDIGED FRWLTLGQLR RLLMLDNWN MDARSILACL 

551 PTADADASAP SPVLRSFFGS PGAARHTTAE VLTWFTGVRA LRELVQNRVP 

601 LDTVTADGWY RTPHEIAHES GRH FRVMAAE VSASSREVTS WTQPLIEPRL 

651 PGLMALLVKS VDGVLHALVR ARVDVGHLNV AELAPTVQCR PQEHTGPRGL 

7 01 PGPPYLEDVL SAPPQDVRYD AVQSEEGGRF FHAQNRYVIV EVPHDFPEDA 

7 51 PDDFAWLSLG QLTGLLAHGN YLNIELRTLV ACAHTLY 
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Figure 13 

1 MVNDPMPRGS GSGSVWLGG AGYVGRHVCA AFAARGRDW VVGRRPPEEP 
51 MPYRCVTLDL AGTDPAALAA ALDAERPDTI VNSVGSIWGR TDEQMWSATA 
101 VPTLRLLEAL ALMSARPRLV HLGSVLEYGP VTPGGSVGAD AVPRPDTAYG 
151 RSKLAASEAV LRGTSGGWVD GWLRVSNVS GPGTPRISLL GQVAERLLAA 
201 AGTGAEAWE LSRLRAHRDY VDVRDVADAV VAAARAPAVP VAVG I GRGE A 
251 VAVRDLVGLF IEASGIPARV VERPAPGRAP GHREDWLRVD TGAARALLGW 
301 APRRSLRESV RDCWHDLVRA HRLPTTPSKH SGG 
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Figure 14 

1 VTTYVWDYLA EYQNERADLL DAVETVFASG QLVLGPSVDG FEKEFADYHG 
51 LRHCGGVDNG TNAVKLGLQA LGVGPGDEW TVSNTAAPTV VAIDGTGATP 
101 VFVDVRAEDH LMDTDQVADV ITPRTKALLP VHLYGQCVDM APLRALAEQH 
151 GLWLE DCAQ AHGARHHGEL AGTLGDAAAF SFYPTKVLGA YGDGGAVLTD 
201 DADVDRALRR LRYYGMEDVY YWQTPGHNS RLDEVQAEIL RRKLTRLDRY 
251 IEGRRAVARR YAEGLANLTG PGGLVLPSVT EGNDHVYYVY WRHPRRDDI 
301 IEALKSYGIS LNISYPWPVH TMTGFAHLGY AKGSLPVTER LADEIFSLPM 
351 YPGLAPDVQD KVIAALHEVL ATL 
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Figure 15 
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Figure 16 

1 MRILLTSFAH NTHYYNLVPL GWALRAAGHD VRVASQPSLT GTITGSGLTA 

51 VPVGDDTAIV ELITEIGDDL VLYQQGMDFV DTRDEPLSWE HALGQQTIMS 

101 AMCFSPLNGD STIDDMVALA RSWKPDLVLW EPFTYAGPVA AHACGAAHAR 

151 LLWGPDWLN ARRQFTRLLA ERPVEQREDP VGEWLTWTLE RHGLAADADT 

2 01 IEELFAGQWT IDPSAGSLRL PVDGEWPMR FVPYNGASW PAWLSEPPAR 

251 PRVCVTLGVS TRETYGTDGV PFHELLAGLA DVDAE I VATL DAGQLPDAAG 

301 LPGNVRWDF VPLDALLPSC AAIVHHGGAG TCFTATVHGV PQIWASLWD 

351 APLKAHQLAE AGAGIALDPG ELGVDTLRGA WRVLE S REM AVAARRLADE 

4 01 MLAAPTPAAL VPRLERLTAA HRRA 
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Figure 17 

1 MNLEYSGDIA RLYDLVHQGK GKDYRAEAEE LAALVTQRRP GARSLLDVAC 
51 GTGMHLRHLG DLFEEVAGVE MSPDMLAIAQ RRNPEAGIHR GDMRDFALGR 
101 RFDAVICMFS SIGHMRDQRE LDAAIGRFAA HLPSGGWIV DPWWFPETFT 
151 PGYVGASLVE AEGRTIARFS HSALEDGATR IDVDYLVGVP GEGVRHLKET 
201 HRITLFGRAQ YEAAFTAAGM SVEYLPHAAT DRGLFVGVQA ' 
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